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Abstract 


A  binary  de  Bruijn  sequence  has  the  property  that  every  n-tuple  is  distinct 
on  a  given  period  of  length  2".  An  efficient  algorithm  to  generate  a  class  of  classi¬ 
cal  de  Bruijn  sequences  is  given  based  upon  the  distance  between  cycles  within  the 
Good  -  de  Bruijn  digraph.  The  de  Bruijn  property  on  binary  sequences  is  shown 
to  be  a  randomness  property  of  the  ZERO  amd  ONE  run  sequences.  Utilizing  this 
randomness  we  find  additional  new  structure  in  de  Bruijn  sequences.  We  analyze 
binary  sequences  that  are  not  de  Bruijn  but  instead  possess  the  sufficient  structure 
so  that  every  distinct  binary  n-tuple  can  be  systematically  “combed”  out  of  the  se¬ 
quence.  These  complete  or  nonclassical  de  Bruijn  sequences  are  a  generalization  of 
the  well-known  de  Bruijn  cycle. 

Our  investigation  focuses  on  binary  sequences,  called  double  Eulerian  cycles, 
that  define  a  cycle  along  a  graph  (digraph)  visiting  each  edge  (arc)  exactly  twice.  A 
new  algorithm  to  generate  a  class  of  double  Eulerian  cycles  on  graphs  and  digraphs 
is  found.  Double  Eulerian  cycles  along  the  binary  Good  -  de  Bruijn  digraph  are 
partitioned  by  the  run  structure  of  their  defining  sequences.  This  partition  allows  for 
a  statistical  analysis  to  determine  the  relative  size  of  the  set  of  complete  cycles  defined 
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by  the  sequences  we  study.  A  measure  that  categorizes  double  Eulerian  cycles  along 
graphs  (digraphs)  by  the  distance  between  the  two  visitations  of  each  edge  (arc)  is 
provided.  An  algorithm  to  generate  double  Eulerian  cycles  of  minimum  measure  is 
given. 
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I.  PERSPECTIVE 

“Begin  at  the  beginning,  ”  the  King  said,  very  gravely,  “and  go  on  till  you  come 
to  the  end:  then  stop.” 

Alice’s  Adventures  in  Wonderland,  Chapter  12 

A.  OVERVIEW 

The  advent  of  modern  high-speed  communication  hardware  creates  a  need  for 
high-speed  techniques  to  generate  random-like  sequences.  Most  digital  computers  and 
many  communication  systems  handle  information  in  binary  form.  One  of  the  sim¬ 
plest  and  most  efficient  devices  for  generating  deterministic,  random  looking  binary 
sequences  is  the  shift  register.  Every  periodic  binary  sequence  is  obtainable  from 
some  suitably  constructed  shift  register.  This  generality  allows  great  versatility  in 
shift  register  applications.  The  applications  for  shift  register  sequences  include  secure 
data  transmission  [Ref.  1],  robot  path  planning  [Ref.  2],  multiple  address  coding 
[Ref.  3],  error  correcting  codes  [Ref.  4],  radar  range  measuring  [Ref.  3],  and  random 
number  generation  [Ref.  5]. 

It  is  well-known  that  a  (2"  —  l)-long  sequence,  i.e.,  a  binary  sequence  of  length 
2”  —  1,  contauning  all  the  non-zero  binary  n-tuples  can  be  obtauned  from  an  n-span 
shift  register  by  means  of  a  feedback  function  consisting  entirely  of  modulo  2  aulditions 
[Ref.  6].  Such  functions  aure  called  linear.  A  great  dead  of  theory  has  been  developed 
about  linear  feedbau:k  shift  registers.  Finite  fields  provide  the  underlying  mathemat- 
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ical  foundation  for  the  linear  feedback  shift  register.  Once  we  remove  the  restriction 
that  the  feedback  function  be  linear,  allowing  products  of  variables,  the  fundamen¬ 
tal  structure  of  the  shift  rigister  changes  dramatically.  As  we  expand  from  linear 
to  nonlinear  feedback  functions  the  number  of  maximum  length  binary  shift  register 
sequences  of  degree  n,  2"-long  sequences  with  distinct  n-tuples,  increases  from  less 
than  ^  to  exactly  .  Unlike  in  the  linear  case,  for  a  general  nonlinear  feedback 
function  it  is  often  true  that  the  best  way  to  determine  the  resulting  sequence  is  to 
exhaustively  construct  the  state  tree,  since  there  are  few  algebraic  approaches  for 
the  nonlinear  analysis.  Many  fundamental  questions  remain  to  be  answered  for  the 
nonlinear  problem.  There  is  a  need  for  expanded  techniques  and  tools. 

In  a  sense,  no  finite  length  sequence  is  ever  truly  random.  In  particular,  no 
sequence  that  depends  on  a  rather  small  number  of  parameters,  such  as  the  feedback 
connections  of  a  feedback  shift  register,  can  be  considered  truly  random.  These 
sequences,  however,  have  the  balance  and  run  randomness  properties  expected  of 
random  sequences  as  defined  by  Golomb  {Ref.  6].  We  consider  only  sequences  of 
finite  length.  Fxirthermore,  the  end  of  the  sequence  is  considered  to  be  contiguous 
with  the  beginning  of  the  sequence,  hence,  sequences  and  cycles  can  be  considered  to 
be  equivalent. 

B.  CURRENT  RESEARCH 

An  important  problem  currently  under  consideration  with  respect  to  nonlinear 
sequences  is  that  of  finding  a  broad  class  of  functions  that  yield  near-maximal  length 
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cycles.  Ford  [Ref.  7],  Lempel  [Ref.  8],  and  Fredricksen  [Ref.  9],  among  others,  have 
provided  methods  for  constructing  de  Bruijn  cycles  and  others  have  expanded  upon 
their  ideas.  Additionally,  classes  of  functions  that  yield  pairs  of  cycles  of  length  2"“' 
are  examined  by  Kibler  [Ref.  10].  But  here,  as  in  so  many  other  facets  of  the  analysis 
of  nonlinear  functions,  even  though  the  structure  suggests  some  underlying  means  of 
classifying  these  functions,  the  discriminating  factor  has  proven  to  be  elusive. 

A  de  Bruijn  cycle  (or  de  Bruijn  sequence)  of  length  2"  has  the  property  that  ev¬ 
ery  n-tuple  appears  exactly  once  on  a  given  period.  For  some  applications  it  might  not 
be  necessary  that  the  n  bits  of  interest  lie  consecutively  along  the  sequence.  Our  work 
examines  an  apparently  unexplored  area  with  respect  to  de  Bruijn  cycles.  Although 
novel,  this  research  has  parallels  to  the  current  direction  of  study  to  find  classes  of 
de  Bruijn  cycles.  We  analyze  sequences  of  length  2”  that  are  not  de  Bruijn  but 
that  have  been  found  to  possess  sufficient  structure  so  that  every  distinct  binary 
n-tuple  can  be  systematically  “combed”  out  of  the  sequence  -  i.e.,  every  n-tuple  can 
be  found  appearing  at  the  successive  positions  in  a  subsequence.  We  find  that  in  this 
way  the  properties  that  make  de  Bruijn  cycles  so  attractive  can  be  extracted  from 
sequences  that  apparently  do  not  contain  each  n-tuple. 

C.  DESCRIPTION  OF  THE  THESIS 

A  simple  method  to  extract  n-tuples  from  a  sequence  is  to  cycle  the  sequence 
through  the  pure  cycling  register  of  length  2”  {PC R^n)  [Ref.  6],  recording  the  bits  of  n 
arbitrary  (not  necessarily  consecutive)  but  fixed  registers  at  each  shift  of  the  PC Rr*- 
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More  precisely,  we  define  a  functional  ^(/i,  /a,  •  - . ,  /a"),  whose  domain  consists  of  the 
2"  functions  (/i,/3,  • . .  ,/2»)-  The  function  /,  projects  the/  bits  (s,,  s,+i, . . . 

/  >  n,  from  a  2'*-long  binary  sequence  (si,  Sa, . . . ,  Sa" )  to  a  binary  n-tuple.  We  restrict 
each  of  the  functions  fi  in  T  to  be  the  projection  from  to  B"  defined  by 

yi(>S»i  “Si+Ii  •  •  *  1  l)  —  ^}i » •  •  •  5  ^in~2 » '*«+/— 1 )) 

where  i  <  ji  <  •  •  •  <  jn-j  <  *  +  /  —  1. 

It  is  useful  to  visualize  each  fi  as  an  /-long  window  containing  n  viewing 
stations  or  as  a  comb  of  length  /  with  n  surviving  teeth,  denoted  as  an  {^,n)-coTnb. 
It  should  be  noted  that  an  (/,  n)-comb  is  always  to  be  applied  to  a  sequence  of  length 
2"  since  our  intent  is  to  extract  2"  unique  n-tuples. 

The  set  of  all  2**-long  balanced  sequences,  i.e.,  those  with  an  equal  number  of 
I’s  and  O’s,  is  denoted  by  Sn-  If  5  ==  0001010001101111,  then  5  is  an  element  of 
that  is  clearly  not  de  Bruijn,  i.e.,  4-tuples  are  repeated.  Hence,  the  (4,4)-comb 
consisting  of  4  consecutive  teeth  is  not  a  comb  for  5.  The  (8,4)-comb  specified  by 

5«+l»  ^»+3>  ^»+5»  ^*+S»  ~  ^i+l  >  ^»+2i  '®»+7)» 

however,  generates  the  following  sequence  of  distinct  4-tuples  from  5:  (0000,  0010, 
0101,  1011,  0100,  1001,  0001,  0011,  0111,  1100,  1010,  0110,  nil,  llio,  llOl,  lOOO). 

It  is  convenient  to  depict  combs  by  a  symbol  that  portrays  the  teeth  of  the 
comb  in  a  fairly  obvious  way.  For  example,  the  (8,4)-comb  specified  above  by 

/»(^«»  ^«+2»  ®«+3»  ^i+4»  ^«+6»  ^»+6»  ^i+r)  —  5«+li  ^»+7) 
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is  represented  by  the  symbol  HI****!.  It  can  easily  be  verified  that  5  has  the  following 
(/,4)-combs:  l•l****l***l.  l*l***l*«**l.  ^*l••**^****l■  (Note  that  these  combs  are 

distinct  combs  and  are  not  cyclic  shifts  of  each  other.) 

We  see  that  a  sequence  can  have  several  associated  combs.  Some  sequences, 
however,  have  exactly  one  comb  while  other  sequences  have  no  combs  at  all.  Clearly, 
a  sequence  of  length  2"  composed  of  a  de  Bruijn  cycle  of  length  2""^  concatenated 
with  itself  cannot  have  a  comb  since,  for  each  t, 

/(Si,  Sj+l,  .  .  .  ,S,+<_i)  =  /(Sj^.2n-J,Si+l+2n-l,...,Sj+/_i+2n-j). 

For  the  same  reason  any  sequence  of  length  2"  possessing  a  periodic  part  of  period  d, 
where  d  is  a  proper  divisor  of  2",  cannot  have  a  comb. 

Fundamentally,  the  search  for  sequences  and  combs  can  proceed  in  either  of 
two  ways:  either  by  finding  a  sequence  satisfying  a  particular  comb  or  by  finding  a 
comb  that  satisfies  a  particular  sequence.  In  either  case,  the  appropriate  sequences 
are  necessarily  of  length  2”  zmd  are  therefore  generated  by  polynomials,  over  the  field 
of  two  elements,  of  the  form  fk{x)  =  (x  +  1)*  for  some  2”"‘  <  fc  <  2"  [Ref.  6].  Only 
those  sequences  from  this  set  that  are  balanced  need  be  considered. 

Occasionadly,  we  categorize  an  (/,  n)-comb  as  an  m-comb  if  at  most  m  of  its  n 
teeth  are  consecutive.  We  establish  a  one-to^ne  correspondence  between  sequences 
and  walks  along  a  graph  or  digraph.  We  show  that  a  sequence  of  length  2**  with  an 
(n  —  k)  comb,  I  <k  <n,  traverses  each  arc  in  the  digraph  exactly  2*  times. 

(The  digraph  "Bn-ik+i)  is  isomorphic  to  the  Good  -  de  Bruijn  digraph  B„_(*+i).)  This 
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property  has  fw-reaching  implica  >ns.  We  find  that  walks  traversing  each  arc  of  the 
Good  -  de  Bruijn  digraph  provide  a  common  thread  throughout  each  of  the  following 
chapters.  The  set  of  walks  that  visit  each  edge  (arc)  in  a  graph  (digraph)  exactly  7 
times  is  denoted  by  W.^. 

The  primary  goal  of  the  research  presented  here  is  to  provide  a  better  under¬ 
standing  of  the  de  Bruijn  property  of  distinct  n-tuples,  binary  de  Bruijn  cycles,  and 
the  Good  -  de  Bruijn  digraph.  The  thesis  consists  of  7  chapters: 

Chapter  II:  The  purpose  of  this  chapter  is  to  make  the  thesis  self-contained. 

We  provide  a  concise  introduction  to  graph  theory,  feedback  shift  registers,  and 
their  resulting  sequences.  This  introduction  is  suited  for  the  analysis  of  double 
Eulerian  cycles,  binary  de  Bruijn  cycles,  and  the  Good  -  de  Bruijn  digraph. 
Apart  from  basic  definitions.  Chapter  II  also  introduces  definitions  that  are  not 
in  common  use  but  proved  to  be  convenient  in  the  nonlinear  theory  of  periodic 
sequences.  We  introduce  some  initial  analysis  on  double  Eulerian  cycles^  cycles 
that  traverse  each  edge  (arc)  along  a  graph  (digraph)  exactly  twice. 

Chapter  III:  We  investigate  sequences  called  complete  cycles  or  nonclassical 
de  Bruijn  cycles.  A  complete  cycle  of  length  2”  has  the  property  that  each  of 
the  possible  2"  binary  n-tuples  lies  along  a  fixed  pattern  or  (/,  n)-comb  of  the 
sequence.  The  analysis  of  these  complete  cycles  is  primarily  concerned  with 
combs  where  n — 1  of  the  bits  of  interest  lie  consecutively  along  the  sequence.  A 
characterization  of  a  class  of  complete  cycles  is  made  in  terms  of  the  walk  they 


define  along  an  appropriate  Good  -  de  Bruijn  digraph.  A  statistical  analysis 
is  made  to  determine  the  number  of  complete  cycles  defined  by  a  particular 
class  of  sequences. 

Chapter  IV:  To  analyze  (n— l)-combs,  i.e.,  those  with  n  — 1  consecutive  teeth, 
some  essential  theory  on  double  Eulerian  cycles  adong  graphs  and  digraphs  is 
developed.  We  define  a  mtasurt  that  categorizes  double  Eulerian  cycles  by 
a  function  of  the  two  visitations  along  each  edge  in  the  traversed  graph.  In 
essence,  the  measure  describes  where  particular  n-tuples  are  located  on  the 
sequence  in  some  rough  sense.  This  measure  pau'allels  the  discrete  logarithm 
problem  of  finite  fields  [Ref.  11]  that  forms  the  foundation  of  some  current  pub¬ 
lic  key  cryptography  systems.  A  conjecture  is  given  for  the  minimum  measure 
(or  value)  of  a  Good  -  de  Bruijn  digraph. 

Chapter  V:  We  describe  how  the  de  Bruijn  property  of  distinct  n-tuples  re¬ 
sults  from  a  randomness  property  of  the  run  lengths  in  a  binary  sequence.  The 
run  structure  of  the  sequences  defining  am  Euleriam  or  double  Euleriam  cycle 
along  the  Good  -  de  Bruijn  digraph  is  completely  determined.  A  statisticad 
amadysis  shows  that  the  property  of  contauning  distinct  n-tuples  in  a  binary  de 
Bruijn  cycle  of  length  2"  is  equivalent  to  the  Expected  Value  Property  for  run 
lengths  in  a  random  binauy  sequence. 
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Chapter  VI:  The  concepts  derived  in  Chapter  IV  are  used  to  develop  a  new 


algorithm  to  generate  classical  de  Bruijn  cycles.  The  algorithm  is  bcised  on 
the  edge-factors  of  the  Good  -  de  Bruijn  digraph  and  the  distance  between 
cycles.  The  edge-factors  of  a  graph  are  found  to  be  an  important  element  to 
determine  the  minimum  measure  of  a  graph  or  digraph. 

Chapter  VII:  Here  we  summarize  the  results  of  the  thesis.  Directions  for 
further  research  are  identified  and  open  problems  are  discussed. 

In  summary,  the  ensuing  chapters  emerge  from  the  development  of  combing 
sequences  in  Chapter  III.  More  importantly,  each  result  is  inspired  by  the  desire  to 
gain  a  greater  understanding  of  nonlinear  binary  sequences  in  general. 

D.  HISTORICAL  NOTE 

PROBLEM:  Given  m  symbols  (which,  without  loss  of  generality,  we  take  to  be 
0,  1,  ••• ,  m-1)  and  a  positive  integer  n,  find  a  sequence  of  these  symbols  having 
minimum  length,  that  when  arranged  as  a  cycle,  contains  every  sequence  of  n 
consecutive  symbols. 

A  solution  to  this  problem  is  a  de  Bruijn  cycle  of  length  (m)".  In  1951  van 
Aardenne-Ehrenfest  and  de  Bruijn  [Ref.  12]  showed  that  de  Bruijn  cycles  exist  for 
all  m  >  2  and  n  >  1.  In  fact  the  de  Bruijn  cycle  problem  has  been  independently 
rediscovered  many  times. 
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The  problem  of  finding  de  Bruijn  cycles  for  m  =  2  became  well-known  through 
de  Bruijn’s  paper  [Ref.  13],  where  the  number  of  solutions  was  found  to  be  2^"”'“". 
de  Bruijn  [Ref.  14],  however,  credits  Stanley  for  discovering  that  the  problem  had  been 
proposed  and  solved  half  a  century  earlier  in  the  French  problem  journal  I’Intermediaire 
its  Mathematiciens  in  1894.  The  problem  was  proposed  by  de  Riviere  in  1894  and 
solved  by  Flj  3  Sainte-Marie  [Ref.  15]  that  same  year.  Flye  Sainte-Marie  found  the 
same  number,  and  his  method  of  solution  was  paralleled  by  de  Bruijn.  Three 

yeeirs  later,  Mantel  [Ref.  16]  found  a  solution  whenever  m  is  prime  using  an  algebraic 
method.  After  1897,  the  problem  was  apparently  entirely  forgotten  until  1934,  when 
it  was  reintroduced  by  Martin  [Ref.  17].  Martin  approached  the  problem  combina- 
torially  and  proved  the  existence  of  de  Bruijn  cycles  for  all  m  and  n  by  creating  an 
algorithm  to  construct  such  cycles.  A  decade  after  Maurtin,  de  Bruijn  [Ref.  13]  and 
Good  [Ref.  18]  independently  rediscovered  and  solved  the  problem  for  the  case  m  =  2 
using  graph  theoretic  and  group  theoretic  concepts. 

The  corresponding  problem  for  m  >  2  symbols  was  first  raised  and  solved 
in  1951  [Ref.  12].  The  number  of  solutions  was  found  to  be  using 

methods  of  determinants  on  the  adjacency  matrix  representing  the  Good  -  de  Bruijn 
digraph. 

Subsequently,  algorithms  to  generate  some  or  all  of  the  de  Bruijn  cycles  of 
length  2**  have  been  repeatedly  uncovered.  It  is  a  palatial  problem  that  will  undoubt¬ 
edly  lure  attempts  to  unravel  it  again  and  again. 
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II 


DEFINITIONS  AND  NOTATION 


“...But  do  cats  eat  bats,  /  wonder?”  And  here  Alice  began  to  get  rather 
sleepy,  and  went  on  saying  to  herself,  in  a  dreamy  sort  of  way,  “Do  cats  eat 
bats?  Do  cats  eat  bats?”  and  sometimes  “Do  bats  eat  cats?”  for  you  see,  as 
she  couldn’t  answer  either  question,  it  didn’t  much  matter  which  way  she  put 
it. 


Alice’s  adventures  in  Wonderland,  Chapter  1 


A.  INTRODUCTION 

Before  describing  double  Eulerian  cycles,  de  Bruijn  cycles,  and  the  methods 
and  techniques  we  need  to  address  in  our  work,  it  is  necessary  to  provide  applicable 
definitions  and  notation.  A  detailed  description  of  linear  and  nonlinear  shift  register 
sequences  appears  in  Golomb  [Ref.  6].  A  text  on  graph  theory  such  as  Bondy  and 
Murty  [Ref.  19],  provides  a  thorough  discussion  on  the  relevant  material  on  graph 
theory.  The  reader  who  is  thoroughly  familiar  with  the  vocabulary  and  concepts  of 
graph  theory  and  de  Bruijn  cycles  may  wish  to  skip  most  of  this  chapter. 

B.  GRAPHS 

A  graph  G  consists  of  a  set  V  =  {ui, V2, . . . ,T;p},  of  elements  called  vertices 
(or  nodes)  and  a  set  £  =  {ei,C2, . . .  ,e,}  of  unordered  pairs  of  vertices  called  edges. 
The  edge  between  vertex  x  and  vertex  y  is  written  as  xy  or  yx.  The  graph  G  is  said 
to  have  order  p  and  size  q.  We  write  G  =  (V,  E)  and  say  V  is  the  vertex  set  and  E 
is  the  edge  set. 
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Edges  of  the  form  e  =  ViVi  are  called  loops.  A  graph  without  loops  is  called 
simple  if  it  also  has  no  pair  of  vertices  forming  more  than  one  edge.  A  graph  is  finite 
if  its  order  is  finite.  We  will  consider  only  graphs  that  aje  finite,  but  not  necessarily 
simple. 

Edge  e  =  ViVj  is  said  to  be  incident  to  the  vertices  u,  and  Vj.  Similarly,  u,  and 
Vj  are  incident  to  the  edge  e.  Two  vertices  that  are  incident  to  a  common  edge  are 
adjacent.,  as  are  two  edges  that  are  incident  to  a  common  vertex.  A  set  of  vertices 
(edges)  is  independent  if  the  vertices  (edges)  are  mutually  nonadjacent.  The  graph 
G'  ~G  —  {u}  is  the  graph  G  with  the  vertex  v  and  its  incident  edges  removed. 

A  subgraph  of  a  graph  G  =  {V,E)  is  a,  graph  H  =  {V\  E')  where  V  CV  zmd 
E'  C  E.  Supfiose  that  V'  is  a  nonempty  subset  of  V.  The  subgraph  of  G  whose  vertex 
set  is  V'  and  whose  edge  set  is  all  of  those  edges  of  G  for  which  both  incident  vertices 
are  in  V  is  called  the  subgraph  of  G  induced  by  V\  Suppose  that  E'  is  a  nonempty 
subset  of  E.  The  subgraph  of  G  whose  vertex  set  is  the  set  of  2dl  vertices  incident  to 
the  edges  of  E'  and  whose  edge  set  is  E'  is  called  the  subgraph  of  G  induced  by  E'. 

A  waUt  W  =  (ui,Ci,t;3,cj, . . .,Cfc_i,t;*)  in  G  is  an  alternating  sequence  of 
vertices  and  incident  edges,  beginning  and  ending  on  a  vertex  of  G.  A  walk  may 
have  repeated  edges  and  repeated  vertices.  The  length  of  a  walk  is  the  number  of 
edges  in  the  sequence.  A  walk  can  also  be  identified  by  merely  listing  the  sequence 
of  adjacent  edges  or,  in  a  graph  without  multiple  edges,  by  listing  the  sequence  of 
adjacent  vertices. 
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Figure  1.  Graphs  and  a  simple  graph 

A  walk  is  said  to  be  closed  if  the  initial  and  terminal  vertices  zu'e  the  same. 

A  closed  walk  is  called  a  cycle.  When  it  is  not  important  to  designate  the  initial 
vertex  of  a  cycle,  the  terminal  vertex  will  typically  not  be  listed  in  the  sequence  of 
adjacent  vertices.  Using  this  notation,  any  cyclic  shift  of  a  cycle  is  considered  to  be 
the  same  cycle.  Further,  this  allows  a  distance  to  be  defined  between  the  vertices 
(edges)  of  a  simple  cycle.  The  distance  between  the  vertices  u,-  amd  Vj  on  a  simple 
cycle  C  =  {vq,  Vi, . . . ,  Up-i)  is  denoted 

Vj)  =  min  ((t  -  j)  mod  p,  {j  -  i)  mod  p) . 

Similarly,  the  distance  between  the  edges  e,  and  ej  on  a  simple  cycle  C  =  (co,  ei, . . . ,  Cp_i) 
is 

ej)  =  min  ((i  -  j)  mod  p,  {j  -  i)  mod  p) . 

A  walk  with  distinct  edges  is  called  a  trail.  If  the  vertices  of  a  trail  are  distinct, 
the  walk  is  called  a  path.  A  closed  path  is  called  a  simple  cycle.  If 

(ui,ci,U2,e2,...,Cfc_i,t;*) 
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Figure  2.  Cycles 

is  a  walk,  vi  is  the  initial  vertex,  the  set  of  interior  vertices  is  {uj,  Vk-2,  Ufe-i}, 

and  Vk  is  the  terminal  vertex  of  the  walk.  That  is,  all  of  the  vertices  of  the  walk 
that  occur  somewhere  other  than  as  the  initial  and  terminal  vertices  of  the  walk  are 
interior.  The  set  of  all  walks  along  a  graph  begirming  at  vertex  v  is  denoted  by  Wv. 
The  set  of  all  walks  in  which  each  of  the  edges  appears  exactly  7  times  is  denoted  by 
VUy.  Additionally,  the  set  of  all  walks  in  where  each  of  the  edges  appears  exactly 
twice  is  denoted  by  Wj, . 

A  cycle  C  of  period  p  contains  p  edges  and  we  write  \C\  =  p.  If  C  is  a  cycle, 
but  not  a  simple  cycle,  then  C  is  sjud  to  be  reducible.  Evidently,  a  cycle  is  reducible 
if  it  is  not  a  closed  path.  The  closed  path  (vi,U2,i;3,U4,U5,t;6)  in  Figure  2  is  both  a 
cycle  and  a  simple  cycle.  The  closed  trail  C  =  {v2,  ve,  us,  us,  Vg,  vi)  is  a  cycle  but  not 
a  simple  cycle,  i.e.,  C  is  reducible.  The  cycle  C  can  be  reduced  to  the  two  simple 
cycles  (u2,U6,i;i)  and  (v65U3,V5). 

The  degree  of  a  vertex  v  in  a  graph,  denoted  deg(v),  is  the  number  of  edges 
incident  to  v.  A  loop  at  a  vertex  contributes  two  to  the  degree  of  that  vertex. 
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A  graph  G  is  said  to  be  connected  if,  for  all  i,j,  there  exists  a  path  connecting 
Vi  and  Vj.  A  graph  that  is  not  connected  is  said  to  be  disconnected. 

A  trail  in  a  graph  G  is  called  Eulerian  provided  it  contains  every  edge  of  G 
exactly  once.  An  Eulerian  cycle  in  G  is  a  cycle  that  includes  each  edge  of  G  exactly 
once.  A  walk  in  a  graph  G  is  called  double  Eulerian  if  it  contains  every  edge  in  G 
exactly  twice.  A  path  in  a  graph  G  is  called  Hamiltonian  provided  that  it  contains 
every  vertex  of  G  exactly  once.  A  Hamiltonian  cycle  in  G  is  a  cycle  that  includes 
each  vertex  of  G  exactly  once. 

The  following  theorem  is  a  well-known  result  in  graph  theory.  See,  for  exsunple, 
Bondy  and  Murty  [Ref.  19]. 

Theorem  II.  1  A  connected  graph  G  has  a  closed  Eulerian  trail  if  and  only  if 
the  degree  of  each  vertex  is  even. 

A  partition  of  a  set  X  is  a  family  {Xi|*  €  /  C  Z"*"}  of  non-empty  subsets  of  X 
such  that  X  =  IJ  Xi  and  the  family  is  pairwise  disjoint. 

t€/ 

A  factor  of  a  connected  graph  G  =  (K,  E)  is  a  set  of  cycles  in  G  that  induces  a 
partition  of  V.  In  a  similar  manner,  an  edge-factor  of  a  connected  graph  G  is  a  set  of 
cycles  in  G  that  induces  a  partition  of  E.  The  set  of  cycles  {(vi,  ua,  ue),  (us,  U4,  us)}  is  a 
factor  of  the  graph  G  in  Figure  2.  The  set  of  cycles  {(vi,  V3,  vg),  (va,  v^,  us),  {v2,  us,  v^,  ua)} 
is  an  edge-factor  of  G. 
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Figure  3.  A  graph  with  a  reducible  cycle 

The  subgraph  induced  by  the  edges  of  a  reducible  cycle  has  a  nontrivial  edge- 
factor.  For  example,  let  the  edge-factor  F  of  G  in  Figure  3  consist  of  the  cycles 
Cl  =  (u2,U6,U3,U6,U6jUi)  and  C2  =  (v2iVz,v^,Vi).  The  subgraph  /f  of  G  in  Figure  3 
is  induced  by  the  edges  of  Ci.  Since  Ci  is  reducible,  H  has  a  nontrivial  edge-factor  con¬ 
sisting  of  the  cycles  A  =  (02,061^1)  and  B  =  The  set  of  cycles  {C2,  A,  B} 

is  an  edge-factor  of  G,  denoted  by  Fa,B’ 

A  tree  is  a  connected  acyclic  graph,  i.e.,  the  path  connecting  any  two  vertices 
is  unique.  Any  vertex  in  a  tree  can  be  distinguished  as  the  root  of  the  tree.  Every 
path  from  the  root  has  a  last  or  terminal  vertex.  The  height  of  a  rooted  tree  is  the 
length  of  the  longest  path  from  the  root. 
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Two  graphs  G\  =  (V^i,£i)  and  G2  =  (Vj,  £^2)  are  isomorphic  if  there  is  a 
bijection  /  from  Vi  to  Vi  such  that  /(x)/(y)  is  an  edge  of  Gj  if  and  only  if  xy  is  an 
edge  of  Gi. 

The  distance^  d(x,y),  between  two  vertices  x  and  y  in  a  connected  graph  G  is 
the  minimum  length  of  a  path  joining  them.  The  edge-distance,  de(v,e),  between  a 
vertex  v  and  an  edge  c  =  xy  in  G  is 

de(v,e)  =  min(d(v,x),d(v,y)). 

Given  a  subset  X  C  V  of  vertices  in  G  =  (V,  E),  the  distance  between  a  vertex  v  E  V 
and  X  is 

d(v,X)  =  mind(v,x).  (Ill) 

Similarly,  given  a  subset  Y  C  E  of  edges,  the  edge-distance  from  v  to  K  is 

Y)  =  mnde(u,  y).  (II.2) 

Let  G  =  (V,  be  a  connected  graph.  The  status,  of  a  vertex  v  in  G  is 
defined  by  Sy  =  ^  d{v,n).  The  median  M{G)  of  a  graph  G  is  the  set  of  vertices 

n€V 

of  minimum  status.  In  a  similar  manner,  the  edge-status,  es^,  of  a  vertex  v  €  G  is 

defined  as  eSy  =  ^  d,{v,e).  The  edge-median  EM(G)  of  a  graph  G  is  the  set  of 
e6£ 

vertices  of  minimum  edge-status. 

For  example,  in  Figure  4  for  the  graph  G  we  find  Sv,  =  9,  =  8,  es„,  =  4, 

csv2  =  4,  Af(G)  =  {t;2}»  and  EM{G)  =  {wi,V2}. 
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Figure  4.  A  graph  for  which  M{G)  /  EM{G) 


Figure  5.  Digraphs 


C.  DIGRAPHS 

A  digraph  (or  directed  graph)  D  =  {V,  A)  has  a  set  K  of  elements  called  vertices 
and  a  set  A  C  V  X  V  of  ordered  pairs  of  (not  necessarily  distinct)  vertices  called  arcs. 
We  think  of  the  arc  a  =  (x,  y)  as  leaving  x  and  entering  y,  that  is,  directed  from  x  to 
y;  X  is  the  inititd  vertex  of  a  and  y  its  terminal  vertex.  A  digraph  may  contain  each 
of  the  arcs  (x,y)  and  (y,x)  as  well  as  loops  of  the  form  (x,x).  A  loop  (x,x)  enters 
and  exits  the  same  vertex  x.  A  general  digraph  may  also  include  mtiltiple  arcs. 


17 


A  vertex  of  the  digraph  D  has  two  associated  degrees.  The  outdegrte  of  a 
vertex  v,  denoted  out(i;),  is  the  number  of  arcs  for  which  v  is  the  initial  vertex.  The 
indegree  of  v,  denoted  in(u),  is  the  number  of  arcs  for  which  i;  is  the  terminal  vertex. 
The  loop  (x,x)  contributes  1  to  each  of  in(x)  and  out(x). 

The  definitions  of  walk,  path,  trail,  factor,  and  cycle  carry  over  from  graphs  to 
digraphs  in  a  fairly  obvious  way.  For  example,  a  directed  walk  in  a  digraph  D  =  iV,A) 
is  a  sequence  of  vertices  and  arcs,  with  the  property  that 

(t;,-,  Ui+i)  €  A  for  1  <  i  <  /:  —  1.  A  directed  walk  is  a  directed  path  if  zdl  of  its  vertices 
are  distinct,  a  directed  trail  if  all  its  arcs  are  distinct,  and  a  closed  directed  walk  if  the 
initial  and  terminal  vertices  are  the  same.  A  closed  directed  walk  is  a  cycle',  a  closed 
directed  path  is  a  simple  cycle. 

For  any  graph  G  =  {V,E)  we  obtsun  a  digraph  D  =  (K,  A)  by  giving  each  edge 
xy  €  E  an  orientation,  that  is,  by  replacing  xy  with  either  (x,  y)  or  (y,  x).  Such  a 
digraph  D  is  called  an  orientation  of  G.  A  graph  has  2*^1  orientations  if  it  has  no 
loops  and  orientations  if  G  has  k  loops.  Conversely,  given  a  digraph  D  =  (V,  A) 
we  can  remove  the  direction  of  its  arcs  thereby  obtaining  a  graph  G  =  (V,  E).  Such  a 
graph  is  called  the  underlying  graph  of  G.  A  digraph  D  has  exactly  one  underlying 
graph,  denoted  by  Gp. 

A  weak  digraph  is  one  that  has  a  connected  underlying  graph.  A  digraph 
D  —  {V,  A)  is  said  to  be  strong  provided  that  for  each  pair  of  distinct  vertices  x,y  €V 
there  is  a  directed  walk  from  x  to  y  and  a  directed  walk  from  y  to  x  (i.e.,  there  is  a 
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Figure  6.  An  illtistration  of  an  edge-factor  of  a  directed  graph 

directed  cycle  that  includes  y  and  x).  A  directed  trail  in  a  digraph  D  =  ( V",  A)  is  called 
Eulerian  provided  that  it  contains  every  arc  of  A  exactly  once.  A  directed  path  is  said 
to  be  Hamiltonian  if  it  contains  every  vertex  of  V  exactly  once.  A  Hamiltonian  cycle 
is  one  that  contains  every  vertex  of  V  exactly  once,  where  the  initial  and  terminal 
vertices  may  be  considered  the  same. 

A  factor  of  a  weak  digraph  D  =  (K,  A)  is  a  set  of  cycles  in  D  that  induce  a 
partition  of  V.  Similarly,  an  edge-factor  of  a  weak  digraph  Z)  is  a  set  of  cycles  in  D 
that  induce  a  partition  of  A. 

As  an  example,  the  set  of  cycles  {(vi),(u2,U3),(v4)}  in  Figure  6  is  a  factor 
of  graph  Bj.  The  set  of  cycles  F  =  {(vi),(vifV2,V3),(v2,V4,V3),(v4)}  constitute  an 
edge-factor  of  ^3. 

For  vertices  x  and  y  in  a  strong  digraph  D,  the  directed  distance  ^^(x,  y)  is 
the  length  of  a  shortest  directed  path  from  x  to  y  in  D.  The  directed  distance  from 
any  vertex  to  itself  is  zero,  i.e.,  do(x,  x)  =  0.  Unless  the  digraph  D  is  symmetric,  it 
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is  not  generally  the  case  that  dD(x,y)  =  dD{y,x)  for  all  vertices  x,y  in  V.  Therefore, 
directed  distance  is  not  in  general  a  metric.  The  distance,  d{x,  y),  between  two  vertices 
X  and  y  in  a  weak  directed  D  digraph  is  the  minimum  length  of  a  path  joining  them 
in  the  underlying  graph  of  D. 

Given  an  edge-factor  F  of  a  digraph  D  =  {V,  A),  the  posture,  Pv{F),  of  a  vertex 
u  6  V  is  defined  by 

Pv{F)=  '£^2\C\de{v,C),  (II.3) 

C€F 

where  |C|  is  the  number  of  edges  in  the  cycle  C,  de{v,C)  =  minde(u,x)  in  Go,  the 

rgC 

underlying  graph  of  D,  and  the  summation  is  over  all  of  the  cycles  of  the  edge-factor. 

The  mean  for  the  factor  F  of  a  directed  graph  D,  denoted  by  is  defined 

to  be  the  set  of  vertices  with  minimum  posture  for  the  edge-factor  F.  As  an  example, 
for  the  edge-factor  F  of  we  find  PviiF)  =  10,  Pv,{F)  =  4,  and  MpiB-i)  —  {v2,  us}* 

D.  GOOD  -  DE  BRUIJN  GRAPHS  AND  DE  BRUIJN 
CYCLES 

The  original  formulation  of  the  de  Bruijn  cycle  problem  for  m  =  2  can  be 
viewed  as  finding  the  number  of  Hamiltonian  paths  in  an  appropriate  directed  graph. 
This  graphical  interpretation  is  very  useful  in  understanding  the  properties  of 
de  Bruijn  cycles. 

A  binary  feedback  shift  register  of  span  n  (FSRn)  is  a  collection  of  n  storage 
devices  (xq,  Xi, . . . ,  x„_i),  each  capable  of  holding  an  element  of  B  =  {0, 1},  together 
with  a  feedback  function  / (xq, x\,..  .  ,x„-i),  taking  on  a  value  of  0  or  1,  computed 


20 


from  the  contents  of  the  n  storage  devices.  The  contents  of  the  register  at  time  t, 
regarded  as  a  binary  n-tuple  or  a  binary  vector,  are  called  the  state  of  the  register.  A 
FSR„  has  2"  possible  states,  namely  the  elements  of  the  set  5"  of  all  binary  n-tuples. 
The  feedback  function  /(x)  of  the  FSRni  where  x  =  (iq,  a:i, . . . ,  in-i),  induces  a 
mapping  F:  B"  B”,  where  f  (x)  =  y  if  and  only  if 

t  =  0, . . . ,  n  2 
f{x);  i  =  n-l. 

At  the  beginning  of  each  time  interval,  determined  by  an  external  clock,  there  is  a 
transition  from  one  state  to  the  next. 

The  superposition  of  ail  possible  state  transition  graphs  for  each  positive  in¬ 
teger  n  defines  the  binary  Good  -  de  Bruijn  digraph  of  order  n,  denoted  by  B„.  See 
Figure  7  for  some  examples  of  de  Bruijn  graphs.  Thus,  the  Good  -  de  Bruijn  di¬ 
graph  Bn  is  a  directed  graph  with  2"  vertices,  each  labeled  with  a  unique  binary 
vector  of  length  n,  and  an  arc  (x,y)  from  vertex  x  =  (xo,xi, . . .  ,Xn_i)  to  vertex 
y  =  (yo,yi,---,yn-i)  if  and  only  if  (xi,X2,. . .  ,x„_i)  =  (yo,yi,...,yn-2).  We  call  y  a 
successor  of  x  and  x  a  predecessor  of  y.  If  (x,y)  €  A,  x  is  adjacent  to  y  in  B„.  The 
conjugate  x  of  the  n-tuple  x  =  (xq, xi, . . . , x„_i)  is  defined  as  x  =  (xo, xi, . . . , x„_i), 
where  x,  =  x,  ©  1  and  ©  denotes  addition  modulo  2.  The  companion  x'  of  the  n-tuple 
X  is  defined  as  x'  =  (xo,xi, . . .  ,Xn-i). 

Each  arc  in  Bn  can  be  viewed  as  an  element  of  the  (n  -f  l)-dimensional  binary 
vector  space  B"'*'^  The  arc  between  the  vertex  (xo, .  • . ,  Xn_i)  and  vertex  (xi, . . . ,  Xn) 
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in  Bn  can  be  labeled  by  the  (n  +  l)-tuple  (xq^Xu- ■  ■  ,Xn-uXn).  The  total  number 
of  arcs  in  Bn  is  2'*'*'*,  and  each  (n  +  l)-tuple  is  assigned  to  a  unique  arc.  With  this 
labeling  we  can  inductively  construct  the  digraph  B„+i.  We  include  an  arc  from 
X  €  to  y  €  B"*^^  in  this  new  graph  Bn+i  whenever  the  terminad  vertex  of  the 
arc  X  €  Bn  is  the  same  as  the  initial  vertex  of  the  arc  y  6  Bn.  We  see  that  a  vertex 
X  is  adjacent  to  a  vertex  y  in  the  induced  digraph  when  the  last  n  coordinates  of 
X  match  the  first  n  coordinates  of  y  in  the  respective  (n  +  l)-tuples  representing  x 
and  y.  This  induced  digraph  is  isomorphic  to  Bn+i  by  the  identity  mapping.  Thus, 
a  closed  Eulerian  trail  of  length  visiting  every  arc  in  B„  defines  a  Hamiltonian 
cycle  visiting  every  vertex  in  Bn+i.  Since  (0,1)  is  a  Hamiltonian  cycle  in  Bi,  and 
there  exists  a  Eulerian  trail  in  Bn,  n  >  1,  we  conclude  that  a  Hamiltonian  cycle  exists 
in  Bn  for  n  >  1. 

Another  way  to  label  the  arcs  in  the  digraph  Bn  is  to  simply  label  the  arc  going 
from  the  vertex  (xo,  •  •  • ,  Xn-i)  to  the  vertex  (xj, . . . , in)  with  the  single  bit  in.  The 
set  of  labels  encountered  as  we  trace  the  arcs  along  a  Hamiltonian  cycle  (path)  in  Bn 
generates  a  binary  de  Bruijn  cycle  (sequence).  Such  a  cycle  is  periodic  of  period  2" 
and  contains  each  of  the  2"  different  binziry  n-tuples  exaM;tIy  one  time  in  each  period 
of  the  sequence.  The  set  of  all  de  Bruijn  cycles  of  length  2"  is  denoted  as  Sn- 

A  subsequence  of  n  consecutive  terms  (or  an  n-sequence)  from  a  sequence 
S  =  (3i,S2,...,S3ft)  is  a  string  of  the  form  (s„s,+i,. . . ,s,+n-i),  where  we  use  the 
convention  that  the  subscripts  are  taken  modulo  2**.  That  is,  we  allow  an  n-sequence 
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of  the  form 


(•^2"— j )  ^2"— j+1 1  •  •  •  )  ^2"  >  ^1 »  ^2*  •  •  •  »  ^n— j  — 1  ))  j  0,1,. ..,n  2, 

which  “wraps  around”  from  the  end  of  5  to  the  beginning.  Any  circular  permutation 

of  5  is  considered  to  be  the  same  sequence. 

A  cycle  of  period  p  in  is  aji  ordered  set  of  distinct  vertices  (states) 

(uo,  Ui, . . . ,  Vp-i),  such  that  is  the  successor  of  t>„  i.e.,  u,+i  =  F(t;,), 

i  =  0, 1, . . ,  ,p  —  2,  and  vq  =  F(vp-i)  .  Clearly,  an  edge-factor  in  Bn  is  isomorphic  to 
a  factor  in  fln+i  • 

In  addition  to  having  two  successors,  each  vertex  in  Bn  has  excictly  two  pre¬ 
decessors.  Moreover,  if  vertex  x  is  a  predecessor  of  vertex  y,  the  following  is  always 
true: 

1.  X  is  a  predecessor  of  y. 

2.  X  is  a  predecessor  of  y'. 

3.  X  is  a  predecessor  of  y'. 

The  four  vertices  x,  x,  y,  and  y'  are  commonly  called  an  adjacency  quadruple.  An  arc 
(x,y)  is  directed  from  vertex  x  to  vertex  y  and  called  incident  with  both  x  and  y. 
Conversely,  x  and  y  are  incident  to  the  arc  (x,y).  A  vertex  x  =  (xo,xi, . . .  ,x„_i)  in 
Bn  is  incident  to  the  arcs  (0,  xq,  xi, . . . , x„_i),  (1, xq,  xj, . . , ,  x„_i),  (xq,  xi, . . . ,  x„_i,  0), 
and  (xo,Xi, . .  •  ,Xn-i,  1)  that  constitute  an  adjacency  quadruple  in  Bn.^.l. 

In  zm  edge-factor  of  B„,  each  arc  uniquely  identifies  its  cycle.  The  cycle  to 
which  the  arc  a  belongs  is  denoted  by  (a).  The  set  of  incident  vertices  for  a  given 
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cycle  (a)  is  denoted  by  (a),,.  Two  cycles  (a)  and  {/3)  are  adjacent  if  they  are  arc 
disjoint  and  there  exists  an  arc  7  on  (a)  whose  conjugate  7  is  on  (/?)  (i.e.,  a,  0  are 
both  incident  to  same  vertex  u).  It  follows  that  (a)  and  {0)  are  adjacent  whenever 
(q)  and  {0)  are  disjoint  and  (q)„  H  (/3)„  0.  Consequently,  each  vertex  is  incident  to 

2  distinct  cycles  in  an  edge-factor  consisting  of  simple  cycles. 

Furthermore,  let  y(o!)v  denote  the  set  of  vertices  t?,  €  (o!)v  for  which 
<i(y^Vi)  =  d{y,  (a)„).  From  each  set  y(a)„  a  specific  vertex  Uj  €  y(Q!)v  is  designated  as 
the  representative  vertex  of  (a),  denoted  by  u,-  =  p(Q)ji. 

Recall  that  the  distance,  d(x,y),  between  two  vertices  x  and  y  in  a  weak 
digraph  is  the  minimum  length  of  a  walk  joining  them  in  the  underlying  graph.  From 
Equation  II.  1  on  page  16,  the  distance,  d(y,  (a)v),  between  a  vertex  y  and  the  set  of 
vertices  (ojv  in  B„  equals  min  {d(y,x)}. 

x€(a)v 

One  factor  occurs  so  often  that  it  has  been  given  a  special  name.  The  Pure 

Cycling  Register  Factor  consists  of  cycles  formed  by  the  cyclic  rotation  of  the  bits 

in  the  Pure  Cycling  Register,  PCRn.  Golomb  [Ref.  6]  shows  that  the  number  of 

cycles,  Z(n),  in  the  PCRn  is  given  by  Z{n)  =  —  where  4>  is  Euler’s 

n 

a|n 

totient  function  and  the  summation  is  over  the  divisors  of  n.  See  Table  I  for  values  of 
<i>{n)  and  Z{n).  Mykkeltveit  [Ref.  20]  shows  that  no  shift  register  (linear  or  nonlinear) 
of  length  n  can  generate  a  factor  containing  more  than  Z{n)  cycles. 
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n 

<^(n) 

2(n) 

1 

1 

2 

2 

1 

3 

3 

2 

4 

4 

2 

6 

5 

4 

8 

6 

2 

14 

7 

6 

20 

8 

4 

36 

9 

6 

60 

10 

4 

108 

11 

10 

188 

12 

4 

352 

Table  I.  Values  of  Euler’s  function  <f>{n)  and  of  the  cycle  function  Z(n),  1  <  n  <  12 
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III.  COMBING  SEQUENCES 


“/  want  a  clean  cup,  ”  interrupted  the  Hatter.  “Let ’s  all  move  one  place  on.  ” 
He  moved  on  as  he  spoke,  and  the  Dormouse  followed  him.  The  March  Hare 
moved  into  the  Dormouse ’s  place  and  Alice  rather  unwillingly  took  the  place 
of  the  March  Hare.  The  Hatter  was  the  only  one  who  got  any  advantage  from 
the  change. 


Alice’s  Adventures  in  Wonderland,  Chapter  7 


A.  INTRODUCTION 

In  Chapter  I ,  Section  C,  we  introduced  the  functional  ^(/i,  /a,  •  •  • ,  /2" )»  whose 
domain  consists  of  the  2"  projections  from  B*  to  B**  defined  on  a  sequence  of  period 
2"  by 

/i('S«5  ^»+l}  •  •  •  »  l)  “  ('®«>  >  *  *  •  )  ^jn-2  >  ^i+/— 1 )? 

where  i  <  ji  <  ■••  <  j„_2  <  *  +  /  -  1. 

The  function  /<  can  be  viewed  as  an  /-long  window  containing  n  viewing  sta¬ 
tions  or  as  comb  of  length  /  with  n  surviving  teeth,  which  we  call  an  (/,n)-comb. 
Occasionally,  we  categorize  such  a  comb  by  its  longest  string  of  consecutive  teeth. 
An  m-comb  denotes  a  comb  with  at  most  m  consecutive  teeth.  It  is  also  sometimes 
convenient  to  normalize  a  comb  so  that  a  longest  string  of  consecutive  teeth  always 
appears  on  the  left.  For  example,  the  (5,3)-comb,  1**11.  defined  on  a  2^-long  sequence 

by 

fii^if  ^i+l  t  ^i+2f  ^i+3f  ^i+4)  ~  ^>+3}  ^i+4)> 
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is  normalized  to  the  (6,3)-comb,  specified  by 


/ii(5«5  ^i+li  ^i+2i  ^i+s)  ~  ('S»>  ^i+l?  ^«+5)» 

i-^M  i!!!l  (Each  are  2-combs).  There  also  may  be  instances  when  one 

wants  to  use  the  version  of  the  comb  that  has  the  smallest  length  /. 

If  a  sequence,  5„,  of  length  2"  has  the  property  that  some  (/,n)-comb  can 
extract  every  different  n-tuple,  then  Sn  is  called  complete  or  nonclassical  de  Bruijn. 
It  follows  that  the  reverse  and  complement  of  a  complete  sequence  are  also  complete. 
By  definition,  every  de  Bruijn  cycle  is  complete. 

B.  SEQUENCES  AND  THEIR  EXTRACTED  n-TUPLES 

To  illustrate  these  concepts,  consider  the  sequence  5  of  length  2^  given  by 
S  =  0000111100101101.  Using  the  classical  de  Bruijn  (4,4)-comb  denoted  by  the 
symbol,  C  =  M  i  and  specified  by 


/f(Si,  Si+i,  Si.j.2,  Sj.4.3)  —  (s,',  S,'.4.2y  ^t+3)) 

4-tuples  are  extracted  from  5  in  the  following  order:  fi  0000,  /2  -*■  0001, 

fs  — >  0011,  /<  — f  0111,  fa  — >  1111,  /e  — ►  1110,  fr  1100,  /s  ->  1001,  fg  — >  0010. 

fio  0101,  fix  — ^  1011,  /12  0110,  /i3  — >  1101,  /i4  — >  1010,  /15  — >  0100, 

/is  -y  1000.  The  (4,4)-comb  applied  to  S  generates  a  sequence  T{C,S),  of  4-tuples 
that  specifies  a  Hamiltonian  cycle  along  the  Good  -  de  Bruijn  graph  Figure  8 
represents  this  cycle.  We  see  that  the  sequences,  S  and  T{C,  5),  define  the  same  cycle 
around  Bn. 
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Figure  8.  A  Hamiltonian  cycle  in  the  Good  -  de  Bruijn  digraph 
The  (8,4)-comb,  C  =  specified  by 

^i+Ii  5«+2?  5i+3,  Sx+4, 5i+5?  ^i+6»  ^t+z)  =  (^ij  5|^i,  5,^2?  >*1+7)1 

generates  the  following  sequence,  T{C\S),  of  distinct  4-tuples  from  5:  (0001,  0000, 
0010,  0111,  1110,  1111,  1101,  1000,  0011,  0100,  1010,  0110,  1100,  1011,  0101,  1001). 
The  sequence  T{C',  S),  however,  does  not  represent  a  Hamiltonizm  cycle  along  the 
de  Bruijn  digraph  B4  since  there  are  consecutive  4-tuples  in  T(C\S)  that  are  not 
adjacent  in  B4.  Since  this  (8,4)-comb  has  3  consecutive  teeth  (i.e.,  it  is  a  3-comb), 
the  sequence  5  must  specify  a  walk  along  the  graph  B3  (Figure  9A)  that  visits  every 
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Figure  9.  Good  -  de  Bniijn  digraph  B3 


vertex  exactly  twice.  The  visitation  times  for  each  vertex  in  this  walk  are  shown  in 
the  parentheses  in  Figure  9B. 

Let  C  be  an  3-comb  for  the  sequence  5.  It  follows  immediately  from  de  Bruijn's 
doubling  theorem  [Ref.  13]  that  the  sequence  5,  since  it  visits  every  vertex  of  B3  twice, 
also  induces  a  double  Eulerian  cycle  along  the  digraph  B3.  Furthermore,  the  sequence 
T(C,S),  of  4-tuples  specifies  a  double  Eulerian  cycle  along  ^4_(i+i),  a  digraph  iso¬ 
morphic  to  Bi  [See  Appendix  Bj.  These  two  resp)ective  cycles,  with  visitation  times 
in  parentheses,  are  shown  in  Figure  10.  The  sequences  T{C,  S)  and  S  define  identical 
walks  along  the  isomorphic  graphs  ^4.(14.!)  and  Bj,  respectively. 
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Figure  10.  and  the  Good  -  de  Bruijn  digraph  B^ 


Lemma  III.l  Let  Sn  be  a  complete  binary  sequence  of  length  2”  with  an 
associated  (n-kj-comb.  Then  Sn  specifies  a  walk  that  traverses  each  arc  along 
the  digraph  J3n-(*+i)  exactly  2*  times  where  (k  +  l)  <  n. 

Proof;  Since  S„  is  complete,  each  (n  —  fc)-tuple  occurs  exactly  2*  times.  The 
arcs  in  the  Good  -  de  Bruijn  digraph,  Bn-(fc+i),  are  uniquely  represented  by 
(n  —  A;)-tuples.  Therefore,  Sn,  specifies  a  walk  that  traverses  each  arc  exactly 
2*  times  along  the  digraph  Bn-{k+i)-  B 


Corollary  III.2  Let  Sn  be  a  complete  binary  sequence  of  length  2"  with  an 
associated  (n-k)-comb,  C.  Then  the  sequence,  T{C,Sn),  of  n-tuples  extracted 
by  the  comb  specifies  a  walk  that  traverses  each  arc  exactly  2*  times  along  the 
digraph  Bn-{k+i)  where  (k  +  1)  <  n. 
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Proof;  Follows  immediately  from  Lemma  III.l  and  the  definition  of  the  iso¬ 


morphic  digraph  Sn-(k+i)-  • 

The  elements  of  the  set  Wjn  for  Bn-ik+i)  define  2"-long  walks  along  the  arcs  of 
the  graph  B„_(*+i)  where  each  arc  is  visited  exactly  2*  times.  The  set  W2*  includes 
all  the  sequences  having  an  (n  -  fc)-comb.  The  following  theorem  of  van  Aardenne- 
Ehrenfest  and  de  Bruijn  [Ref.  12]  allows  us  to  count,  in  any  Eulerian  regular  directed 
graph  D,  the  number  of  sequences  that  define  a  walk  along  D  where  eatch  arc  is 
traversed  exactly  p  times.  Table  II  provides  the  number  of  sequences  defining  an 
Eulerian  or  double  Eulerian  cycle  along  Bn,  for  1  <  u  <  4. 


Theorem  III.3  (van  Aardenne-Ehrenfest  and  de  Bruijn  )  Let  D  =  {V,  A) 
be  an  Eulerian  regular  directed  graph  where  in(v)  s=  a.  Then  the  number  of 
ways,  ||D||p,  to  traverse  each  arc  in  D  exactly  p  times  is 


(111.1) 


where  <t>  is  the  Euler’s  totient  function,  ||/?||i  ts  the  number  of  Eulerian  cycles 
in  D,  and  the  summation  is  extended  over  all  divisors  of  p. 


Theorem  III.3  was  proved  using  an  interesting  but  considerably  complicated 
argument.  The  development  and  proof  of  the  theorem  is  summarized  in  Appendix  A. 

Let  5  be  a  binary  sequence  of  length  2"  with  am  (n—  l)-comb.  By  Lemma  III.l, 
5  is  am  element  in  the  set  of  sequences,  Wj,  that  specify  a  double  Eulerian  cycle  along 
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n 

Eulerian 

Double  Eulerian 

1 

1 

5 

2 

2 

82 

3 

16 

52,496 

4 

2048 

44,079,843,328 

Table  II.  The  number  of  Eulerian  and  double  Eulerian  cycles  along  Bn,  1  <  n  <  4 

Bn-2-  From  Theorem  III.3,  the  number  of  distinct  double  Eulerian  cycles  along  Bn-2 
is  given  by 

||Bn-llU  =  (1  +  3»— (111.2) 

Not  every  sequence,  however,  in  Wj  has  an  (n  —  l)-comb.  Finding  the  general  formula 
for  the  number  of  sequences  with  an  (n  —  l)-comb  has  been  elusive.  In  the  follow¬ 
ing  two  sections  we  show  that  the  number  of  sequences  of  length  2’*  possessing  an 
(n  —  l)-comb  is  much  smaller  than  the  number  of  sequences  with  an  n-comb. 

C.  THE  RUN  STRUCTURE  OP  SEQUENCES  POS¬ 
SESSING  (n-l)-COMBS 

It  is  well-known  that  the  number  of  de  Bruijn  cycles  of  length  2”  is  2^"”*"”. 
Therefore,  of  all  of  the  double  Eulerian  cycles  along  Bn-2,  there  are  2^"”*""  that  are 
defined  by  de  Bruijn  cycles.  In  this  section  we  show  that,  statistically,  we  should 
expect  the  number  of  2"-long  sequences  with  an  (n  —  l)-comb  to  be  only  2^" 

A  run  of  ZEROs  in  a  binary  sequence  is  defined  to  be  a  subsequence  of  con¬ 
secutive  O’s  that  is  preceeded  and  followed  by  a  1.  The  length  of  a  run  of  ZEROs 
is  the  number  of  consecutive  O’s  in  the  subsequence.  A  run  of  ONEs  is  similary  de- 
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fined.  A  binary  sequence  can  be  interpreted  as  a  sequence  of  integers  representing  the 
lengths  of  the  alternating  runs  of  ZEROs  and  ONEs  in  the  sequence.  For  example, 
the  sequence: 

5  =  00001010110111010011001000011111 

corresponds  to  the  run  sequence  R  =  (4111121311222145).  The  sequence,  R,  consists 
of  a  subsequence  Zs  =  (41111224)  of  the  run  lengths  of  runs  of  ZEROs  interleaved  (or 
perfectly  shuffled  [Ref.  21])  with  a  subsequence  O5  =  (11231215)  of  the  run  lengths 
of  runs  of  ONEs. 

The  multiset  consisting  of  the  lengths  of  the  runs  of  ZEROs  and  ONEs  in  a 
de  Bruijn  cycle  is  completely  determined,  (See  Chapter  V  or  [Ref.  6]),  In  fact,  a 
run  sequence  that  defines  a  cycle  in  in  Bn  can  also  be  determined.  For  example, 
when  5  is  a  sequence  that  defines  a  double  Eulerizm  cycle  along  B„,  the  multiset  of 
the  lengths  of  the  runs  of  ZEROs  or  ONEs  in  S  consists  of  one  of  only  two  possible 
multisets,  respectively. 

Theorem  III.4  Let  S  be  a  sequence  defining  a  double  Eulerian  cycle  along 

Bn-  Then  S  satisfies  one  of  the  following  2  criteria: 

1.  S  has  2”“*  runs  of  ZEROs  of  length  k  for  I  <  k  <  n  and  a  single  run  of 
ZEROs  of  length  n  +  2,  or 

2.  S  has  2”“*  runs  of  ZEROs  of  length  k  for  1  <  fc  <  n  —  1  and  two  runs  of 
ZEROs  of  length  n  +  1  • 

The  same  distribution  holds  for  the  runs  of  ONEs. 


Proof:  Since  each  (n  +  l)-tuple  occurs  exactly  twice  in  5,  the  longest  runs  of 
ZEROs  must  consist  of  either: 

1.  Exactly  1  run  of  ZEROs  of  length  n  +  2,  or 

2.  Exactly  2  runs  of  ZEROs  of  length  n  +  1. 

Case  1:  The  run  of  ZEROs  of  length  n  +  2  must  be  preceded  and  followed  by 
a  1,  or  the  (n  +  l)-tuple  (00  •  •  •  00)  would  appear  at  least  three  times  in  5.  The 
(n  +  l)-tuple  consisting  of  a  1  followed  by  the  n-tuple  (00  . .  .0)  adso  occurs 
exMtly  twice  in  the  sequence.  One  occurrence,  however,  is  already  accounted 
for  by  the  run  of  ZEROs  of  length  n  +  2.  Thus,  there  is  an  additioncil  run 
of  ZEROs  of  length  n  that  provides  the  second  (n  +  l)-tuple  consisting  of  a 
1  followed  by  the  n-tuple  00  ...  0.  Thus,  there  is  no  run  of  ZEROs  of  length 
n  4- 1.  To  find  the  number  of  runs  of  ZEROs  of  length  fc,  for  1  <  fc  <  n  —  1,  we 
consider  all  n  + 1  consecutive  bits  of  the  sequence  that  begin  with  a  1  followed 
by  the  A;-tuple  00  ...  0  and  then  a  1.  E2M:h  such  run  can  be  made  to  correspond 
to  an  arbitrary  (n  -f  l)-tuple  of  the  form 

IQO ..  .Qlxx..  .z, 

k  n—k—l 

where  the  z’s  axe  chosen  as  arbitrary  bits.  Since  we  are  free  to  choose  each  of 
the  remaining  n  —  k—l  bits,  there  are  x  2  runs  of  ZEROs  of  length  k 

for  1  <  k  <  n  —  1.  With  the  single  run  of  ZEROs  of  length  n  and  the  single 
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n 

1 

2 

3 

4 


Eligible  Run  Distributions  for  B„ 

{3,1} 

{2,2} 

{4,2,1,!} 

{3,3,1,!} 

{5, 3, 2, 2, 1,1,1,!} 

{4,4,2, 2,1, 1,1,1} 

{6,4,3,3,2,2,2,2,1,1,1,1,1,1,1, 

{5,5,3,3,2,2,2,2,1,1,1,1,1,1,1, 


1} 

1} 


Table  III.  Distribution  of  runs  for  double  Eulerian  cycles  along  B„ 


run  of  ZEROs  of  length  n  +  2  the  result  follows.  The  same  argument  holds  for 
runs  of  ONEs. 

Case  2:  The  two  runs  of  ZEROs  of  length  n  +  1  must  each  be  preceded  and 
followed  by  a  1,  or  the  (n  +  l)-tuple,  00'--00,  would  appear  at  least  three 
times  in  S.  The  (n  +  l)-tuple  consisting  of  a  1  followed  by  the  n-tuple  00  ...  0 
occurs  exactly  twice  in  the  sequence.  These,  however,  are  already  accounted 
for  by  the  two  runs  of  ZEROs  of  length  n  + 1.  Thus,  there  is  no  run  of  ZEROs 
of  length  n.  In  a  like  manner  as  above,  there  are  x  2  runs  of  ZEROs  of 

length  fc,  for  1  <  fc  <  n  —  1.  With  the  two  runs  of  ZEROs  of  length  n  +  1  the 
result  follows.  The  same  argument  holds  for  the  run  distribution  of  ONEs.  ■ 


Every  double  Eulerian  cycle  around  Bn  can  be  described  by  a  binary  sequence 
whose  run  sequence,  R,  consists  of  a  subsequence  Zn  interleaved  with  a  subsequence 
On-  The  subsequences  Zn  and  On  are  respectively  a  permutation  of  one  of  the  two 
multisets  in  Theorem  III.4.  To  determine  how  many  sequences  have  (n  —  l)-combs. 
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it  is  useful  to  partition  all  of  the  double  Euleriaja  cycles  into  one  of  the  following  3 
sets:  'P(n,2),  ^(n.i),  and  P(„)  defined  as: 

1.  The  set,  'P(n,2)i  consists  of  sequences  where  both  the  ZERO  and  ONE  run 
sequences  are  permutations  of  the  multiset  that  includes  the  element  n  +  2. 

2.  The  set,  'P(n,i)>  consists  of  sequences  where  both  the  ZERO  and  ONE  run 
sequences  are  permutations  of  the  multiset  that  includes  the  element  n  +  1 . 

3.  The  set,  'P(n)j  consists  of  sequences  where  the  ZERO  and  ONE  run  sequences 
are  permutations  of  different  multisets. 

The  sizes  of  each  of  the  3  sets,  ‘P(n,2)>  ^(n.i),  and  'P(„),  are  determined  in  a 
straightforward  manner.  Equation  III.2  provides  the  total  number  of  double  Eulerian 
cycles,  ||B„||2,  where  ||R„||2  =  11^(n.2)l  +  !^(n.i)|  +  l'P(n)l-  Let  denote  the  Eulerian 
digraph  constructed  from  the  Good  -  de  Bruijn  digraph  by  removing  the  two  arcs 
(loops),  (0)  and  (1 ).  One  can  see  that  |^(n,i)|  is  equivalent  to  the  number  of  sequences 
representing  double  Eulerian  cycles  along  B„,  i.e.,  l'P(n,i)|  =  ll^^nlb-  From  Equation 
A. 5  in  Appendix  A  we  find 

l^(n.i)l  =  1(2(2"-"-') +2-(2"-^‘-2)2(2"-'‘-i)12(*"-=»)22) 

_  2(2"-n-2)  ^  2(2"-n-2)3(2’’-2)  (III.3) 

=  2(2"-'‘-2)(1  +  3(2"-2))^ 

Now  let  Bn  and  Bn  denote  Eulerian  digraphs  constructed  from  the 
Good  -  de  Bruijn  digraph  by  removing  just  the  loop,  (0)  and  (1),  respectively.  We 
define  Q  to  be  the  set  of  sequences  representing  double  Eulerian  cycles  along  B„.  By 
inserting  a  0  into  the  longest  run  of  ZEROs  of  a  sequence  Q  €  Q,  a  sequence  is  created 
that  contains  a  run  distribution  of  ZEROs  that  includes  the  element  (n  +  1)  tw'ce 
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and  a  run  distribution  of  ONEs  from  either  of  the  two  eligible  multisets.  In  a  similar 
manner,  each  sequence  representing  double  Eulerian  cycles  along  is  equivalent  to  a 
sequence  with  a  run  distribution  of  ONEs  that  includes  the  element  (n  +  2)  and  a  run 
distribution  of  ZEROs  from  either  of  the  two  eligible  multisets.  We  define  %  to  be  set 
of  sequences  representing  the  double  Eulerian  cycles  along  B„.  The  set  Q\J'R  =  U 
is  equivalent  to  the  set  of  double  Eulerian  cycles  that  includes  all  the  elements  of 
■Pjn).  In  addition  to  the  elements  of  P(n)»  the  set  U  also  includes  the  sequences  that 
have  ZERO  and  ONE  run  sequences  appeairing  zis  permutations  of  the  multiset  that 
includes  the  element  n  +  1,  i.e.,  'P(n.i)  U  V(n)  =  U.  It  follows  immediately  that 

\n  =  (llSnIU  -  |J’(n..)l)  X  2, 

since  H^nlb  =  ll^nlU-  By  again  applying  Equation  A.5  in  Appendix  A,  we  find  that 
l^(n)l  =  (^2(2"-"-!)  +  -  2<2"-»-2)(1  -(.  3<2"-2)))  x  2 

=  2(2’’-’»)3(2"-2). 

(111.4) 

Consequently,  it  follows  that 

I^K2)|  =  \\Bnh-\V^r.,l)\-\V(r.)\ 

(111.5) 

_  2l2"-n)3(2"-2) 

Table  IV  provides  the  values  for  ||B„||2,  |P(„,2)|,  |^(n,i)|,  and  \V(n)\,  for  1  <  n  <  4. 
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n 

ll^n||2 

l^(n.2ll 

..  I^(n|l 

1 

5 

2 

1 

2 

2 

82 

36 

10 

36 

3 

52,496 

23,328 

5,840 

23,328 

4 

44,079,843,328 

19,591,041,024 

4,897, 761,280 

19,591,041,024 

Table  IV.  Double  Eulerian  cycles  along  Bn,  by  category 


D.  RANDOMNESS  OF  THE  DOUBLE  EULERIAN  WALKS 

In  this  section  we  show  that  the  number  of  classical  de  Bruijn  cycles  in  the 
set  7^(n,2)  coincides  with  the  expected  number  of  de  Bruijn  cycles  that  would  exist 
if  the  sequences  in  P(n,2)  had  a  particular  randomness  property.  (Recall  that  of  the 

2(2»-n)3(2--2) 

sequences  in  ^(n,2)>  exactly  are  de  Bruijn,  since  the  sequence 

length  corresponds  to  a  Hamiltonian  path  through  Hn+2  ) 

A  Randomness  Property:  From  any  vertex,  each  departing  aurc  is  equally  likely 
to  be  traversed  next  in  a  walk  along  Bn  defined  by  the  2”'*'*-long  sequence,  Sn+2, 
where  Sn+2  €  V{n,2)- 

Each  vertex  in  is  entered  4  times  along  the  walk  defined  by  the  sequence 
S„+2.  Let  the  arcs  in  B„  be  labeled  with  O’s  and  I’s  as  described  in  Chapter  II. 
Suppose  vertex  v  is  entered  from  the  arc  labeled  ai  for  the  first  time.  Let  Xi  be  a 
binary  indicator  variable  where  =  1  if  and  only  if  the  next  arc  traversed  from  v 
is  labeled  1.  The  indicator  variable  =  0  if  and  only  if  the  next  arc  traversed  from 
V  is  labeled  0.  Similarly,  when  a  vertex  v  is  entered  from  the  arc  labeled  ai  for  the 
second  time,  let  X2  be  a  binary  indicator  variable  where  X2  =  1  if  and  only  if  the 
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next  arc  traversed  from  v  is  labeled  1.  The  indicator  variable  X2  =  0  if  and  only  if 
the  next  arc  traversed  from  v  is  labeled  0. 

The  joint  probability  function  for  the  two  discrete  random  variables  Xi  and 
X3  is  given  by 

Px,.xj(xi,x2)  =  P(Xi  =  xi,X2  =  X2),  for  Xi,X2  €  {0, 1}. 

If  ^  X2  for  each  vertex  in  Bn,  the  sequence  5n+2  is  de  Bruijn  of  length  2"^^,  since 
each  (n  -|-  2)-tuple  is  distinct.  The  probability  that  Xi  ^  X2  for  each  vertex  in  B„  for 
the  walk  5„+2  can  be  computed  quite  easily. 

Let  t;  be  a  vertex  that  is  not  incident  with  a  loop  in  Bn.  Since  Sn+2  defines  a 
walk  where  each  outgoing  arc  from  t;  is  equally  likely  to  be  visited  next  and  each  arc 
in  Bn  is  visited  exactly  two  times,  it  follows  that 

P(X,  =  1|X,  =  0)  = 

where  the  bar  within  the  parentheses  indicates  conditional  probability.  Furthermore, 
if  u  is  a  vertex  incident  with  a  loop  then, 

P(X2  =  1|^,  =  0)  =  1, 

since  each  arc  in  Bn  must  be  visited  twice  and  we  must  visit  the  loop  at  this  time  or 

/  \  2^—2 

miss  it  entirely.  Therefore,  the  probability  that  S„+2  is  de  Bruijn  is  p  =  ( 

If  the  sequences  in  ‘P(n,2)  have  the  property  that  they  each  define  a  walk  by 
which  subsequent  arcs  from  each  vertex  are  equally  likely  to  be  traversed,  we  would 
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expect 

(9\2"-2  /9x2"-2 

I)  =2-"-"r-»x(|)  =2'-'— ' 

sequences  in  ‘P{n,2)  to  possess  the  de  Bruijn  property.  Since  there  are  exactly 
de  Bruijn  cycles  in  'P(n,2)i  this  expectation  is  realized.  In  the  next  section,  we  discuss 
the  implication  of  this  randomness  property  with  respect  to  the  expected  number  of 
complete  cycles  associated  with  an  (n  —  l)-comb. 

E.  THE  NUMBER  OF  COMPLETE  CYCLES 

Counting  the  exact  number  of  sequences  associated  with  sji  (n  —  A:)-comb, 
1  <  A:  <  2"“^  —  1,  remains  a  difficult  problem.  We  can,  however,  estimate  the  size 
of  the  set  of  complete  cycles  associated  with  a  particular  comb.  One  could  postulate 
that  there  are  equal  numbers  of  sequences  for  each  possible  comb,  especially  since  the 
previous  section  suggests  that  the  successive  bits  (0  and  1)  are  equally  likely  among 
the  set  of  eligible  sequences.  It  has  been  found  by  a  computer  sezurch  that  sequences 
with  n-combs  tend  to  be  much  more  common  than  sequences  with  (n  —  fc)-combs,  for 
A:  >  1.  This  result  is  supported  by  the  same  probabilistic  aurgument  developed  in  the 
previous  section. 

For  simplicity,  we  again  utilize  the  sequence  Sn+2  defined  in  Section  D.  Here  we 

n— I 

find  the  probability  that  Sn+2  supports  an  (n  —  l)-comb  of  the  type  II  "  '  II  *L  Suppose 
that  vertex  v  is  entered  from  the  arc  labeled  qi  for  the  first  time.  Then,  the  indicator 
variables  Xi  and  X2  are  defined  as  before,  except  that  now  we  are  concerned  with  the 
second  arc  traversed  from  v.  Therefore,  we  j^^n  let  Xi  be  a  binary  indicator  variable 
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where  =  1  if  and  only  if  the  second  arc  traversed  from  t;  is  labeled  1.  The  indicator 
variable  Xi  =  0  if  and  only  if  the  second  arc  traversed  from  v  is  labeled  0.  Similarly, 
when  vertex  v  is  entered  from  the  arc  labeled  qi  for  the  second  time,  let  Xj  be  a 
binary  indicator  variable  where  Xj  =  1  if  zmd  only  if  the  second  arc  traversed  from 
V  is  labeled  1.  The  indicator  variable  Xj  =  0  if  and  only  if  the  second  arc  traversed 
from  V  is  labeled  0. 

In  contrast  to  the  situation  described  in  Section  D,  the  inequality  of  Xi  aind  Xj 
for  all  vertices  in  B„  is  not  a  sufficient  condition  for  Sn+2  to  support  an  (n  —  l)-comb. 
It  must  also  be  the  case  that  Yi  ^  Vs,  where  Yi  and  V}  similarly  defined  on  the 
other  2J-C,  Qj,  entering  the  vertex  v.  Therefore,  when  vertex  v  is  entered  from  the  arc 
labeled  qj  for  the  first  time,  let  Yi  be  a  binary  indicator  variable  where  Ki  =  1  if  and 
only  if  the  second  arc  traversed  from  v  is  labeled  1.  The  indicator  variable  >1=0 
if  and  only  if  the  second  axe  traversed  from  u  is  labeled  0.  Similarly,  when  vertex  v 
is  entered  from  the  arc  labeled  aj  for  the  second  time,  let  >2  be  a  binary  indicator 
variable  where  Vj  =  1  if  and  only  if  the  second  arc  traversed  from  v  is  labeled  1.  The 
indicator  variable  >2  =  0  if  and  only  if  the  second  arc  traversed  from  v  is  labeled  0. 

The  joint  probability  function  for  the  4  discrete  random  variables  Xi,  X2,  VI, 
and  V2  is  given  by 

PXi,X2,Yi,Y,{^uX2,yi,y2)  =  PiXi  =  Xi,X2  =  X2,Yi  =  yi,Y2  =  ^2), 
for  Xi,X2,yi,y2  €  {0,1}.  If  Xi  ^  X2  and  Vi  ^  Y2  for  all  the  vertices  in  B„,  the 

n— 1 

sequence  has  a  comb  specified  by  II " '  II  *L  The  probability  that  Xi  ^  X2  for  each 
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vertex  in  Bn  not  incident  with  a  loop  is  |.  Given  that  Xi  ^  X2,  the  probability  that 
Vi  Vj  is  Therefore  the  probability  of  Xi  ^  Xj  and  Ki  ^  Vj  is  «  5. 
Furthermore,  if  u  is  a  vertex  incident  with  a  loop  then, 


P(X2  =  =  0)  =  1, 


since  each  arc  must  be  visited  twice. 

n— I 

II  ‘  “  11*1  comb  is  approximately  (5) 


Therefore,  the  probability  that  5'„+2 
We  would  expect  to  find  that 


has  a 


/1n2"-2  /1\2"-2 

X  (i)  =  X  (i)  =  2^’-" 


of  the  sequences  in  'P(n,2)  l^ave  these  combs.  For  n  =  3,  of  the  23,328  sequences  in  the 
set  Vy  there  are  exactly  2®  combs  of  the  type  supporting  our  claim.  In  general,  the 
experimental  results  support  the  probabilistic  predictions.  We  have  found  that  the 
sizes  of  the  set  of  2’‘-long  sequences  with  (n  —  l)-combs,  of  all  types,  is  substantially 
smaller  than  2^"”'”",  the  number  of  de  Bruijn  sequences  of  length  2". 

In  the  following  chapters  we  develop  the  necessary  theoretical  concepts  on 
double  Eulerian  cycles  to  analyze  the  Good  -  de  Bruijn  digraphs  and  to  gain  a  better 
understanding  of  complete  cycles.  A  complete  combinatorial  explanation  for  the 
number  of  sequences  satisfying  a  particular  comb  is  an  intriguing  goal  that  seems 
very  hard  to  achieve  at  present. 
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IV.  A  MEASURE  ON  A  GRAPH 

“Would  you  tell  me,  please,  which  way  I  ought  to  go  from  here?” 

“That  depends  a  good  deal  on  where  you  want  to  get  to,  ”  said  the  cat. 

“  I  don’t  much  care  where  ...”  said  Alice 

“Then  it  doesn't  matter  which  way  you  go,  ”  said  the  cat. 

Alice’s  adventures  in  Wonderland,  Chapter  7 

A.  INTRODUCTION 

In  this  chapter  we  develop  the  theoretical  concepts  regarding  double  Eulerian 
cycles,  i.e.,  walks  on  a  connected  graph  or  weak  digraph  that  visit  every  edge  exactly 
twice.  A  measure  can  be  given  to  the  visitation  pattern  of  a  double  Eulerian  cycle. 

Every  de  Bruijn  cycle  (and  some  nonclassical  de  Bruijn  cycles)  of  length  2” 
defines  a  walk  that  visits  every  vertex  on  B„,  traverses  each  arc  of  Bn-i,  *uid  passes 
through  each  arc  on  Bn^2  exactly  twice.  From  these  latter  walks  emerges  a  measure 
of  complexity  of  the  sequence  in  question  and  a  notion  of  the  value  for  the  underlying 
graph. 

B.  THE  VALUE  OF  A  GRAPH 

We  define  W2  to  be  the  set  of  walks  that  traverses  each  edge  (arc)  in  the  graph 
(digraph)  exactly  twice.  The  set  W2  is  called  the  set  of  double  Eulerian  cycles.  We 
define  a  measure  on  a  walk  in  W2  as  the  sum,  over  all  edges  of  the  graph,  of  the 
positive  difference  of  the  visitation  times  on  each  edge.  The  value  of  a  graph  G  is 
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Figure  11.  Double  Eulerian  cycles  with  visitation  times 


then  defined  to  be  the  minimum  measure  over  all  double  Eulerian  walks  on  G.  In 
addition,  every  connected  graph  always  has  a  value. 

In  Figure  11,  two  different  double  Eulerian  walks  are  presented.  The  numbers 
in  parentheses  are  the  visitation  times.  Walk  A  =  (ui,  t>2>  ua,  U4,  U5,  V2,  U3,  t;^,  1)5,  V2,  uj) 
has  a  measure  of  25.  Walk  B  =  (u2,t;i,V2,t;5,V4,U5,uj,U3,  V4,t;3,  V2)  has  a  measure  of 
9,  which  can  be  shown  to  be  the  value  of  the  graph. 


Theorem  IV.l  Let  G  be  a  connected  graph.  Then  G  has  at  least  one  double 
Eulerian  walk. 

Proof:  Let  H  be  the  graph  generated  by  duplicating  each  edge  in  G.  The 
degree  of  each  vertex  in  H  is  then  even.  Therefore,  Theorem  II.  1  implies  that 
H  has  a  closed  Eulerian  trail  and  G  must  have  a  double  Eulerian  cycle.  As  an 
immediate  corollary,  a  double  Eulerian  cycle  in  G  yields  an  Eulerian  walk  in 
H.  m 


45 


Corollary  rV.2  Every  double  Eulerian  cycle  in  a  grapi.  G  is  a  closed  walk. 

Proof:  Let  H  be  the  graph  determined  by  duplicating  each  edge  in  G.  From 
Theorem  IV.  1,  a  walk  in  H  is  Eulerian  if  and  only  if  the  walk  is  double 
Eulerian  in  G.  The  degree  of  each  vertex  in  H  can  be  counted  by  following 
an  Eulerian  walk  Wi  in  H.  Each  occurrence  of  a  vertex  along  Wi  adds  2  to 
the  degree  of  an  interior  vertex  in  Wi  and  1  to  the  degree  of  the  initial  and 
terminal  vertices  (recall  that  a  walk  is  a  sequence  of  vertices  and  edges).  Since 
the  degree  of  each  vertex  in  H  is  even,  the  initial  and  terminal  vertices  of  the 
walk  must  be  the  same  vertex.  Therefore,  every  double  Eulerian  cycle  in  G  is 
closed.  ■ 

Since  a  double  Eulerian  cycle  is  closed,  the  edge  sequence  representing  a  double 
Eulerian  cycle  can  be  viewed  as  a  cycle.  Alternatively,  we  can  also  define  a  measure  on 
a  walk  in  W]  as  the  sum  over  all  edges  of  the  distance  along  the  cycle  between  the  two 
occurrences  of  each  edge.  From  this  perspective,  the  cycle  distance  between  identical 
edges  in  VV2  remains  constant  for  each  cyclic  shift  of  the  edge  sequence  representing 
the  double  Eulerian  cycle.  In  Figure  12,  A  =  (65,61,63, 62,63, 63, 61,65,  £4, 64)  is  a 
double  Eulerian  walk.  As  before,  visitation  times  are  in  parentheses.  The  distance 
between  the  first  appearance  at  each  edge  ei,  63, 63, 64, 65  and  the  second  appearance 
in  walk  A  are  5,1, 1,1,3,  respectively.  Walk  B  =  (61, 63, 63, 63, 63,  ci,  65, 64, 64, 65)  is  the 
sequence  generated  by  cyclically  shifting  walk  A  by  one.  The  distances  between  the 
two  occurrences  of  each  edge  in  walk  B  are  unchanged. 
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Figure  12.  Double  Eulerian  cycles  with  visitation  times  (cycled) 


In  a  graph  G  with  size  q,  the  sequence  of  edges  /i,  /a,  •  •  • ,  f2<i  in  a  double  Eule¬ 
rian  cycle  completely  determines  the  measure  of  the  walk.  More  precisely,  the  measure 
of  a  walk  is  ultimately  determined  by  the  number  of  edges  previously  traversed  ex¬ 
actly  once  (called  single-edges  or  <T-edges)  along  the  double  Eulerian  cycle.  When 
an  edge  e  has  been  traversed  twice  (or  not  at  all)  during  the  walk,  subsequent  edges 
along  the  walk  can  neither  increase  nor  decrease  the  difference  of  the  two  visitation 
times  for  that  edge.  When  an  edge  e',  however,  has  been  traversed  exactly  once  during 
the  waik,  each  subsequent  edge  traversed  along  the  walk  increases  the  difference  of  the 
visitation  time  for  e'  by  one.  Thus,  it  is  the  number  of  current  cr-edges  at  each  step 
edong  the  walk  that  determines  the  measure  of  the  double  Eulerian  cycle.  Finally, 
as  stated  before,  the  aggregate  sum  of  the  <7-edges  at  eau:h  vertex  along  the  double 
Eulerian  cycle,  is  the  measure  of  the  walk.  From  this  perspective,  the  measure  of  a 
double  Eulerian  cycle  can  be  calculated  as  follows: 


1.  Transform  the  sequence  of  edges  in  a  double  Eulerian  cycle  into  a 

sequence,  {an}nLi  of  I’s  and  -I’s  as  follows: 


4 


Edge  {x,y}  = 


il  if  this  is  the  first  occurrence  of  the  edge  {x,  y} 
on  the  double  Eulerian  cycle 
—1  otherwise. 


29 

2.  Let  {<n}nLi  be  the  sequence  of  partial  sums  of  the  series  ^  a„. 

n=l 

2q 

3.  The  .measure  of  the  walk  is  ^  tj. 

k=l 


For  example,  consider  walk  A  in  Figure  11.  We  find  the  following: 


1-  K}n=l  =  {1,1, 1,1, 1,-1, -1,-1, -1,-1}. 

2.  =  {1,2,3, 4,5,4, 3,2, 1,0}. 

29 

3.  The  measure  of  walk  A  is  ij  =  25. 

fc=i 

It  is  interesting  to  note  that  set  of  possible  sequences  {an}n=i  1’*  -I’s 
for  a  graph  G  defined  in  this  way  is  a  subset  of  all  of  the  (north/east)  routes  between 
opposite  comers  on  a  q  x  q  lattice  that  are  on  or  below  the  diagonal,  where  q  is  the 
si2:e  of  G,  It  is  well-known  [Ref.  22]  that  the  total  number  of  subdiagonal  routes  is 

where  Cn  is  the  nth  Catalan  number. 

In  the  remainder  of  this  section  we  show  that  the  value  of  a  graph  G  can  be 
determined  as  a  function  of  the  edge-status,  es„,  of  a  vertex  v  where  v  €  EM{G)  and 
q,  the  size  of  G.  First  we  develop  some  results  relating  the  concepts  of  status  and 
edge-status. 
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Theorem  FV.S  Let  T  =  (V,  £)  he  a  tree.  Then  es^,  =  —  q  where  es^  is  the 

edge-status  of  v,  Sv  is  the  status  of  v,  and  q  is  the  size  of  T. 

Proof;  If  a  tree  T  has  one  edge,  it  follows  immediately  that  es^  =  s^  —  q. 
Assume  T  has  more  than  one  edge.  Let  v  be  a  root  of  T  that  gives  T  a  height 
of  h.  Assume  ei  and  62  are  two  edges  such  that  de(v,ei)  =  de(t;,  62)  =  k  and 
where  Ci  and  62  are  incident  with  the  same  vertex  V3  with  d{v,  V3)  =  A:-|-l.  Then 
(u, . . . ,  Cl,  U3)  and  (u, . . . ,  62,  V3)  are  two  different  waJks  from  v  to  the  vertex 
V3.  This  infers  a  cycle  in  T.  Therefore,  each  edge  e  £  E,  where  de{v,e)  =  i, 
0<t</i  —  lis  incident  with  a  unique  vertex  n  eV  —  v  where  d{v,  n)  =  i  +  1. 
Since  |£|  =  |V|  —  1,  it  follows  that 

es„  =  5^de(t;,e)  =  d(v,e)  -  1  =  Sv  -  9. 

e€E  ngV— V 


The  following  corollary  follows  immediately  from  Theorem  IV.3. 

Corollary  rV.4  LetT  be  a  tree,  with  edge-median  EM{T)  and  median  M{T). 
Then  EMiT)  =  M{T). 

Unfortunately,  if  a  graph  G  is  not  a  tree,  it  is  not  necessarily  true  that 
EM(G)  =  M{G).  Graph  G  in  Figure  13  has  EM{G)  —  {ue}  Jmd  M{G)  =  {vs}. 
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G; 

Figure  13.  Example  for  which  EM(G)  ^  M{G) 

The  next  result  provides  a  lower  bound  for  the  measure  of  a  waJk  in  1^2,  where 
V  €  V  on  a  connected  graph  G  =  ( V,  E). 

Lemma  IV.5  Let  G  =  (V,  jB)  be  a  connected  graph  of  size  q.  Let  v  €V,  with 
edge-status  es„.  If  m  is  the  minimum  measure  of  a  double  Eulerian  cycle  VV2, 
beginning  at  v,  then  m  >  2(e5v)  +  g. 

Proof:  Let  v  be  the  initial  vertex  of  a  double  Eulerian  cycle  in  G.  Let 

maxde(v,e)  =  n.  Since  a  double  Eulerian  cycle  is  closed,  all  of  the  mem- 
€^E 

bers  of  the  set  of  edges  of  edge-distance  j, '  <  j  <  n,  from  v  must  be  visited 
twice  before  aU  edges  of  edge-distance  j  —  1  are  visited  twice.  It  follows  that 
there  are  at  least  j  <r-edges  when  any  edge  Cj  in  the  set  of  edges  of  distance 
j  from  V  is  visited  for  the  first  time  in  a  double  Eulerian  wcdk.  Furthermore, 
there  are  at  least  j  -f  1  <r-edges  when  ej  is  traversed  the  second  time  in  the 
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walk  (since  Cj  is  now  included  in  the  set  of  tr-edges).  Therefore,  each  edge  Cj 
at  edge-distance  j  from  v  increases  the  measure  of  a  walk  by  at  least  2j  + 1.  It 
follows  that  if  m  is  the  minimum  measure  of  a  double  Eulerian  walk  beginning 
at  V,  then 

m  >  V  (2  (de(v,  e))  +  1)  =  2(csv)  +  q. 
e€£ 


We  now  describe  an  algorithm  (Algorithm  A)  to  construct  a  double  Eulerian 
cycle  of  minimum  measure  on  a  connected  graph.  The  algorithm  includes  a  subroutine 
Cyclel  that  is  recursively  called  throughout  the  algorithm.  Informally,  Algorithm  A 
is  very  similar  to  the  depth-first  search  algorithm  [Ref.  23].  The  algorithm  proceeds 
from  the  initial  vertex  v  in  a  forward  direction  (adding  new  edges)  for  as  long  as  this 
is  possible.  When  it  is  no  longer  possible  to  advance  (add  a  new  edge),  the  algorithm 
backtracks  to  the  first  vertex  from  which  it  is  then  possible  to  go  forward  revisiting 
edges  as  it  goes.  The  algorithm  proceeds  until  each  edge  is  visited  exactly  two  times. 
Each  new  edge  added  by  the  2dgorithm  has  the  following  properties: 

1.  The  new  edge  is  not  in  the  current  walk, 

2.  The  new  edge  is  incident  with  the  last  visited  vertex  in  the  walk,  and 

3.  The  edge-distance  from  v  to  the  new  edge  is  exactly  one  greater  than  the 
edge-distance  from  v  to  the  last  visited  <T-edge  in  the  walk. 
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It  follows  from  the  above  that  when  no  new  edge  can  be  added  to  the  walk,  the  next 
edge  traversed  is  the  last  visited  <T-edge  in  the  walk  (i.e.,  the  a-edge  of  greatest  edge- 
distance  from  the  initial  vertex  u).  This  is  in  essence  the  backtracking  aspect  of  the 
algorithm. 

ALGORITHM  A  [Constructs  a  double  Eulerian  cycle  on  a  connected  graph 
G  =  (V,E)  beginning  at  vertex  i/  €  V] 

Input:  A  graph  G  =  (V,  E)  of  size  q  and  a  vertex  i/  as  globed  parameters. 

Output:  Array  P,  an  ordered  list  of  edges  constituting  a  double  Eulerian  cycle 
through  G  beginning  at  the  given  vertex  t/. 

Parameters  N  =  [ni, . . . ,n2,+i],  P  =  ^  global 

arrays  of  vertices,  edges,  and  edges  of  size  2q  +  l,  2q,  and  q,  respectively. 

(Note:  The  parameter  d  is  an  integer  that  denotes  the  edge-distance  of  the  last  visited 
<T-edge  from  vertex  1/  in  an  ongoing  walk.  When  there  are  no  <7-edges  in  the  walk,  d  is 
set  to  -1.  The  parzuneter  Uj  indicates  the  current  vertex  in  the  walk.  The  parameters 
i  and  j  are  integer  counters  indicating  the  number  of  edges  currently  traversed  and 
the  number  of  current  <7-edges  along  the  walk,  respectively.) 
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Figure  14.  A  graph  to  illustrate  Algorithm  A 


Algorithm  A 


Initially  set  j  :=  1;  i  :=  1;  d  :=  —1;  Ui  :=  v;  rik  :=  empty;  2  <  A:  <  2q  +  1; 
Pi  :=  empty,  I  <l  <2q;  and  hm  ■=  empty;  1  <  m  <  q. 


Cyclel(iV,  P,  H,j,  i,  d);  end. 

Subroutine  Cyclel  (JV,  P,  H,j,  i,  d) 

if  de(n,,  c)  =  0  and  de(ni,  e)  =  d  +  1  and  de{i/,  e)  =  d(i/, n,)  and  e  ^  P,  for  e  6  £■ 
then  n,+i  :=  x  where  e  =  (n,,  x) 

Pi  :=  € 
hj  :=  e 
d  ;=d+l 
i  •.=  1  +  1 
j  :=  i  +  1 
Cyclel(Ar,P,//,j,i,d) 
else  Pi  :=  hj 

Wt+i  ;=  ni_i 

d  :=d-l 
i  :=  i  +  1 
j  :=  j  -I 
if  i  =  2q 
then  end. 

else  Cyclel(  A^,  P,  H,j,  i,  d) 


To  illustrate  Algorithm  A,  consider  the  graph  G  in  Figure  14.  Let  U4  be  the 
initial  vertex  i/.  Both  Ci  and  cq  satisfy  the  criteria  for  being  a  new  edge  to  add 
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to  the  walk.  Without  loss  of  generality  select  ei.  Subsequently,  both  62  and  €4 
satisfy  the  criteria  for  a  new  edge  to  enter  the  walk.  Let  be  selected.  Continu¬ 
ing,  the  double  Eulerian  cycle  constructed  by  Algorithm  A  is  the  sequence  of  edges 
P  =  [61,64,64,62,63,  C3,C2,  Cl,  cg, 65,  csiCfi]-  Alternatively,  if  62  had  been  selected 
rather  than  64,  the  sequence  of  edges  P'  =  [61,62,63,63,62,64,64,61,66,65,65,66]  is 
generated.  Both  F  and  P'  have  the  same  measure. 

The  algorithm  adds  a  new  edge  to  the  walk  when  the  three  conditions  stated 
above  are  satisfied.  Each  new  edge  has  an  edge-distance  from  u  of  exactly  one  greater 
than  the  last  o'-edge  visited.  During  the  walk  if  a  new  edge  cemnot  be  trav  ed,  the 
algorithm  backtracks  through  the  last  visited  (7-edge  until  the  criteria  for  adding  a 
new  edge  is  met.  When  a  (r-edge  is  added  through  backtracking,  the  (7-edge  to  be 
traversed  is  the  one  that  has  the  greatest  edge-distance  from  u  of  all  (7-edges  currently 
in  the  walk.  This  is  clearly  the  last  visited  (7-edge.  The  following  lemma  shows  that 
ultimately  each  edge  in  the  graph  is  traversed  exactly  twice. 

Lemma  IV.6  Let  G  =  (V,E)  6e  a  connected  graph.  Algorithm  A  constructs 
a  double  Eulerian  cycle  on  G  beginning  at  v  €V, 

Proof:  We  prove,  by  induction  on  the  edge-distance  from  the  initial  vertex  r, 
that  each  edge  is  traversed  exactly  twice. 

Basis  Step:  The  edge-distance  between  the  first  edge  e  on  the  walk  generated 
by  Algorithm  A  and  the  initial  vertex  v  is  0.  Since  there  are  only  a  finite 
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number  of  edges,  the  edge-distance  from  v  to  each  edge  is  finite.  Therefore, 
the  walk  must  eventually  backtrack  through  each  of  the  (T-edges  of  the  ongoing 
walk  generated  by  the  algorithm.  When  there  is  no  edge  of  distance  1  to  be 
added  after  edge  e,  taen  edge  e  will  be  revisited  by  backtracking.  Hence  e  is 
traversed  exactly  2  times.  In  a  similar  fashion  when  the  walk  continues  along 
another  edge  e'  of  edge-distance  0  from  v,  finiteness  and  backtracking  insure 
that  e'  is  traversed  exactly  twice. 

Inductive  Step:  Assume  that  the  algorithm  traverses  esich  edge  e  where 
de(v,e)  =  k.  Let  Ck+i  be  an  edge  of  edge-distance  A:  -j-  1  from  v.  There 
exists  an  edge  Ck  of  edge-distance  k  incident  with  edge  e/b+i,  else  Ck+i  is  not 
of  distance  A:  -I- 1  from  v.  Let  vertex  Vk  be  incident  to  both  ejt  and  Ck+i  ■  The 
distance  from  v  to  Vk  is  k,  else  either  the  edge-distance  to  Ck  is  not  k  or  the 
edge-distance  to  Ck+i  is  not  A;  -f- 1.  When  e*  is  traversed  for  the  first  time,  the 
edge  Ck+i  satisfies  the  criteria  to  enter  the  waJk.  Either  e^+i  or  another  edge 
®fc+i  of  distance  A:  -|-  1  will  be  the  next  edge  added  to  the  walk.  If  Ck+i 
is  not  the  next  edge  in  the  walk,  then  because  there  aure  only  a  finite  number 
of  edges  and  by  backtracking,  the  walk  will  eventuadly  return  to  u*.  The  edge 
e*+i  still  satisfies  the  criteria  to  enter  the  walk.  Since  there  are  only  a  finite 
number  of  edges  incident  to  Vk,  ek+i  must  ultimately  enter  the  walk  and  be 
traversed  exactly  twice  through  backtracking.  ■ 
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We  made  the  point  that  when  an  edge  of  distance  k  from  u  is  added  to  a  walk 
in  Algorithm  A,  it  is  added  when  an  edge  of  distance  A:  —  1  is  the  last  a-edge  in  the 
walk.  Now  we  make  a  further  statement  about  the  set  of  tr-edges  when  an  edge  of 


distance  k  is  added  to  a  walk. 

Lemma  IV. 7  When  an  edge  Cfc  of  edge-distance  k  from  the  initial  vertex  of 
the  walk  is  traversed  for  the  first  time  using  Algorithm  A,  there  are  exactly  k 
edges  that  have  been  previously  traversed  exactly  once. 

Proof:  We  prove,  by  induction  on  the  edge-distance,  that  when  an  edge  e*  of 
edge-distance  k  from  the  initial  vertex  t;  of  the  walk  is  traversed  for  the  first 
time  using  Algorithm  A,  there  are  exactly  k  a-edges  on  the  walk. 

Basis  Step:  When  an  edge  e  of  edge-distemce  0  is  traversed  for  the  first  time, 
we  are  at  the  vertex  v  and  there  are  no  <r-edges  remaining  in  the  walk. 

Inductive  Step:  Assume  that  when  any  edge  of  edge-distance  k  is  traversed 
using  Algorithm  A  for  the  first  time  there  are  exactly  k  a-edges  in  the  ongoing 
walk.  Let  de{v,  e*+i)  =  k+1.  The  edge  e*+i  is  initially  traversed  only  when  the 
last  visited  a-edge  e  in  the  waJk  has  an  edge-distance  of  k  from  v.  Therefore, 
by  the  inductive  hypothesis,  there  are  then  exactly  fc  -|-  1  <T-edges  after  Ck+i 
initially  enters  the  walk.  ■ 
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Now  we  can  relate  the  measure  of  double  Eulerian  cycles  in  a  graph  G  to  the 
edge-status  of  G. 

Theorem  IV.8  Let  G  =  {V,E)  be  a  connected  graph  of  size  q.  Let  u  €  V, 
with  edge-status  es^.  If  m  is  the  minimum  measure  of  a  double  Eulerian  cycle 
Wv  beginning  at  v,  then  m  =  2(cs„)  -H  q. 

Proof:  Algorithm  A  constructs  a  double  Eulerian  cycle  through  a  graph. 
When  an  edge  ej  of  edge-distance  j  from  the  initial  vertex  in  the  walk  is 
traversed  for  the  first  time  using  Algorithm  A,  Lemma  IV. 7  states  there  are 
exactly  j  <T-edges.  This  then  adds  j  to  the  measure  of  the  walk  (1  for  each  of 
the  j  <r-edges).  When  an  edge  e,  of  edge-distance  j  is  traversed  by  Algorithm 
A  for  the  second  time,  there  are  exactly  j  -|- 1  tr-edges  (the  edge  Cj  now  belongs 
to  the  set  of  o-edges).  This  adds  j  -1- 1  to  the  measure.  Therefore,  the  mezisure 
of  a  double  Eulerian  cycle  constructed  by  Algorithm  A  is 

S  c)  +  1)  =  51  (2‘^e(v,  e)  +  1)  =  2(eSv)  +  q. 

e€£  e€£ 

It  follows  immediately  from  Lemma  IV.5  that  the  minimum  measure  of  a 
double  Eulerian  cycle  Wj,  beginning  at  vertex  v  is  2(eSv)  +  9.  ■ 

Corollary  rV.9  Let  G  be  a  connected  graph.  Let  W2,  be  a  double  Eulerian 
cycle  on  G  that  produces  the  value  of  G.  Then  v  e  EM{G). 
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G: 

Figure  15.  To  illustrate  the  value  of  a  graph 
Proof;  Follows  directly  from  Theorem  IV.8  zmd  the  definition  of  EM{G).  ■ 

Corollary  IV.IO  Let  G  =  {V,E)  be  a  connected  graph  of  size  q  with  a  value 
of  m.  Let  V  €  EM{G),  with  edge-status  es^.  Then  m  =  2(es„)  +  q. 

Proof;  Follows  immediately  from  Theorem  IV.8.  ■ 

Corollary  FV.ll  Let  T  =  (V,  £)  be  a  tree  of  size  q.  Let  v  €  V,  with  edge- 
status  cSv.  The  measure  m  of  a  double  Eulerian  cycle  Wy  inT  ism  =  2{sy)—q. 

Proof;  Follows  directly  from  Theorem  i  -  ^  and  Theorem  IV.8.  ■ 

We  have  shown  that  the  value  of  a  connected  graph  G  is  determined  by  the 
edge-status,  esy  of  a  vertex  u  €  EM{G)  and  by  q  the  number  of  edges  in  G.  Algo¬ 
rithms  to  find  the  median  of  a  graph  can  be  found  in  [Ref.  24]  and  [Ref.  25).  These 
algorithms  can  be  modified  in  a  straightforward  manner  to  determine  the  distances 
from  vertices  to  edges  and  the  edge-median  of  a  graph. 
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We  discussed  earlier  that  the  measure  of  a  walk  IV  €  W3  can  be  defined  using 
the  distance  between  the  two  occurrences  of  each  edge  on  the  cycle  W.  If  each  cyclic 
shift  of  a  cycle  is  considered  equivalent,  this  definition  assigns  the  same  measure  to 
equivalent  cycles.  In  Figure  15,  the  double  Eulerian  cycle, 

W  =  (cj ,  C5,  £4,  C3,  C3,  Cj,  63,  64,  Cfi,  Cfi,  C5,  Cl ), 

and  each  cyclic  shift  of  W  has  the  measure  12  when  using  the  distance  between 
identical  edges  to  calculate  the  measure. 

C.  THE  VALUE  OF  A  DIRECTED  GRAPH 

The  following  well-known  theorem  is  stated  without  proof.  See,  e.g.,  [Ref.  19]. 

Theorem  rV.12  A  weak  digraph  is  Eulerian  if  and  only  if  for  every  vertex 
its  in-degree  and  out-degree  are  equal. 

From  this  theorem,  the  next  result  follows  readily. 

Lemma  IV.IS  Every  double  Eulerian  walk  in  a  digraph  D  is  closed. 

Proof:  Let  P  =  {u  =  =  v)  be  a  double  Eulerian  cycle  in  the 

digraph  D  =  {V,  A)  where  |A1  —  k.  If  u  u,  then  v  appears  an  even  number 
of  times  on  P,  since  each  incoming  arc  on  v  is  used  exaw:tly  twice.  Since  v  is 
the  terminal  vertex  of  P,  however,  v  has  been  exited  an  odd  number  of  times. 
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This  is  a  contradiction,  since  P  is  double  Eulerian.  Thus  u  =  u,  and  P  is  a 
closed  walk.  As  an  immediate  corollary,  in(u)  =  out(u),  and  D  is  Eulerian.  ■ 

The  following  theorem  shows  that  the  Eulerian  and  double  Eulerian  property  of  a 
digraph  are  equivalent. 

Theorem  rV.14  A  weak  digraph  D  is  double  Eulerian  if  and  only  if  D  is 
Eulerian. 

Proof:  Suppose  that  a  digraph  D  is  double  Eulerian.  Since  a  double  Eulerian 
cycle  provides  a  path  between  any  pair  of  vertices,  the  digraph  must  be  strong 
(and  hence  also  weak).  By  Lemma  IV.13,  the  in-degree  and  out-degree  of  each 
vertex  is  even.  Therefore,  D  is  Eulerian.  Conversely,  suppose  D  is  Eulerian. 
Traversing  an  Eulerian  trail  exactly  twice  yields  a  double  Eulerian  cycle.  ■ 

The  Eulerian  graph  can  be  partitioned  into  edge  disjoint  cycles. 

Theorem  rV.15  A  digraph  D  has  an  edge-factor  if  and  only  if  it  is  Eulerian. 

Proof:  Let  be  an  Eulerian  digraph.  A  closed  Eulerian  trail  on  Z?  is  an  edge- 
factor.  Conversely,  let  F  be  an  edge-factor  of  the  digraph  D.  The  in-degree 
and  out-degree  of  any  vertex  in  each  cycle  of  F  are  equal.  Since  the  arcs  on 
the  cycles  in  F  partition  the  set  of  arcs  in  D,  the  in-degree  and  out-degree  of 
each  vertex  in  D  must  be  equal.  Therefore,  D  is  Eulerian.  ■ 
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By  Theorem  IV.  15  every  double  Eulerian  cycle  Hj,  along  a  digraph  D  defines 
a  specific  edge-factor  F.  Given  F  we  can  obtain  a  lower  bound  for  the  measure  of 

Theorem  IV. 16  Ltt  F  =  {Cx^C-i^. . .  ,Cj}  be  an  edge-factor  of  the  digraph 
D.  The  measure,  m,  of  any  double  Eulerian  cycle  Wj,  in  D  satisfies 

Proof:  We  prove  this  by  using  strong  induction  on  k,  where  k  is  the  largest 
number  of  cycles  in  any  edge-factor  of  D. 

Basis  Step:  Let  Z)  be  a  digraph  such  that  for  any  edge-factor  F  of  D,  [Fj  =  1 
(i.e.,  no  edge-factor  of  D  has  more  than  a  single  cycle).  Therefore  the  set  E 
of  cycles  in  each  edge-factor  of  D  is  the  set  of  Eulerian  cycles  in  D.  Then 
F  =  {R}  where  R  £  E.  Every  double  Eulerian  cycle  in  VV2,  along  D  is  exactly 
two  traversals  of  such  a  cycle  R.  Therefore,  the  mecisure  of  any  walk  in  W2,  is 

|fl|“  =  E  + ^v(F). 

C^F 

(Note;  Pv{F)  —  0  since  all  vertices  v  sje  on  R  emd  C  =  F  is  the  only  cycle  in 

F-) 

Inductive  Step:  Let  F  =  {Ci,  (72, . . . ,  C/}  be  an  edge-factor  of  the  digraph  D. 
Assume  that  the  value  m  of  2uiy  double  Eulerian  cycle  W2,  in  D  satisfies 

|C.f  -I-  Pv(F). 

t=i 
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Let  D  be  a  digraph  satisfying  |F|  =  n,  and  let  W2,  be  a  double  Eulerian 
cycle  on  D.  Since  D  is  finite,  then  at  some  point  along  W2,  a  vertex  is  repeated. 
This  defines  a  cycle  in  D.  Let  C  be  the  first  cycle  completed  along  W2, .  Let  a 
be  the  arc  traversed  on  C.  After  a  is  initially  traversed  there  are  exactly 
k  <T-arcs  in  C.  Let  a'  be  the  arc  on  C  to  be  traversed  for  the  second  time 
along  W2,.  After  a'  is  traversed  for  the  second  time,  there  axe  \C\  —  j  <r-arcs 
in  C.  Furthermore,  each  step  along  C  increases  the  difference  of  the  visitation 
time  by  one  for  each  <T-arc  not  in  C.  Since  each  arc  in  C  is  traversed  twice,  C 
increases  by  2  the  difference  of  the  visitation  times  for  exactly  one  arc  of  each 
edge-distance  i  from  v  (by  the  definition  of  C),  where  0  <  t  <  de{v,C)  —  1. 
Therefore,  the  set  of  arcs  in  C  taken  together  increases  the  measure  of  the 
walk  W2,  by  some 

\c\  |C| 

£c  >  E  *  +  E(IC|  -  i)  +  2|CK(t>,  C)  =  |C|’  +  C). 

k=l  i-l 

Let  ly  =  D  —  (C)v.  After  we  remove  C,  the  edge-factor  for  each  component 
of  ly  has  at  most  n  —  1  cycles.  Let  F'  =  {C[,  Cj, . . . ,  Cj}  be  an  edge-factor 
of  the  digraph  D*.  By  the  inductive  hypothesis  the  measure  m  of  any  double 
Eulerian  cycle  on  any  component  G  of  O'  satisfies 

i=l 
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Therefore,  if  F  =  {Ci^C], . . .  ,0/}  is  an  edge-factor  of  the  digraph  D,  then 
the  measure  m  for  any  double  Eulerian  cycle  along  D  satisfies 
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We  now  describe  an  algorithm  (Algorithm  D)  that  constructs  a  double  Eulerian 
cycle  of  minimum  measure,  beginning  at  a  vertex  von  an  Eulerian  digraph.  Algorithm 
D  adds  cycles  to  create  a  double  Eulerian  cycle  whose  measure  attains  the  lower  bound 
of  Theorem  IV. 16.  This  algorithm  is  similar  to  Algorithm  A.  The  reader  should  make 
note  of  the  similarities  and  differences  between  the  roles  played  by  the  edges  of  a  graph 
in  Algorithm  A  and  the  cycles  in  the  Eulerian  digraph  in  Algorithm  D.  Informally 
speaking.  Algorithm  D  moves  forward,  entering  cycles  of  greater  distance  from  a 
vertex  v  for  as  long  as  this  is  possible.  When  it  is  no  longer  possible  to  enter  a  new 
cycle  at  greater  distance,  the  sJgorithm  traverses  each  au:c  on  the  current  cycle  exactly 
twice  and  then  backtracks  to  the  previous  cycle.  The  backtracking  continues  to  the 
first  cycle  from  which  it  is  possible  to  go  forward  to  enter  new  cycles.  It  is  clear  that  a 
cycle  C  may  be  entered  at  several  places  along  a  given  walk.  The  first  edge  traversed 
on  C  cannot  be  at  distance  d  >  de{v,C).  Algorithm  D  does  not  allow  a  cycle  to  be 
entered  for  the  first  time  at  any  vertex  other  than  a  vertex  of  distance  de{v,  C). 

Algorithm  D  includes  a  subroutine  Cycle2  that  is  recursively  called  through¬ 
out  the  algorithm.  Algorithm  D  constructs  a  double  Eulerian  cycle  Wj,  by  succes- 


sively  selecting  cycles  from  an  edge-factor  to  be  traversed.  Each  new  cycle  C  added 
has  the  following  properties: 

1.  There  are  de{v,C)  arcs  in  that  have  been  traversed  exactly  once  when  C 
is  entered. 

2.  No  arc  on  the  cycle  C  has  previously  been  traversed. 

3.  If  n  is  the  last  vertex  encountered  along  the  walk,  then  n  €  (C)v  and 
d{v,n)  =  de(v,C). 

ALGORITHM  D  [Constructs  a  double  Eulerian  cycle  on  an  Eulerian  digraph  be¬ 
ginning  at  vertex  u] 

Input:  An  Eulerian  digraph  D  =  (V^A)  of  size  q,  a  vertex  u  £  V,  and  edge-factor 
C  =  {ci,  C2, . . . ,  c/}  as  global  parameters. 

Output:  Array  P,  an  ordered  list  of  the  arcs  constituting  a  double  Eulerian  cycle 
through  D  beginning  at  the  given  vertex  u. 

Parameters:  N  —  [ni,. . . ,n2,+i],  P  =  [pi,. •  •  ,P2j],  and  M  =  [mi,...,m/],  are 
global  arrays  of  vertices,  arcs,  and  integers  of  size  2g  -|-  1,  2q,  and  /,  respectively. 
Parameters  t,  j,  and  t  are  integers. 

The  following  is  a  brief  description  of  specific  parameters: 

1.  The  parameter  nj  indicates  the  current  vertex  in  the  walk. 

2.  The  value  of  m;  indicates  the  following  about  the  arcs  in  the  cycle  c/: 

(a)  If  m/  =  0  then  no  arcs  on  C/  are  currently  on  the  walk, 

(b)  If  m;  =  1  then  at  least  one,  but  not  every  arc  in  c/  is  currently  on  the  walk, 

(c)  If  m/  =  2  every  arc  in  c/  has  been  traversed  at  least  once. 

3.  The  parameter  t  indicates  the  number  of  <T-arcs  currently  in  the  walk. 
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Algorithm  D 

Initially  set  j  :=  0;  i  :=  1;  t  :=  0;  ni  :=  i/;  n*  :=  empty,  2  <  k  <  2q  -h  1; 
p,  :=  empty,  1  <  s  <  2q;  and  :=  0,  1  <  ^  <  /. 

Cycle2(iV,  P,  M,  C,  j, :,  t);  end. 

Subroutine  Cycle2(-/V,  P,  A/,  C,  j,  r,  t) 

(Dl)  if  [[<1(1^, n.)  =  de(i/,c/)  =  t]  and  [m/  =  0]]  where  rn  €ciJ  e  {1,2, ...,/} 
then  j  :=  j  +l 
T emp q 

Cl  :=  Cj 

Cj  ;=  Temp 
T emp  :=  m/  :=  mi  +  I 

mi  :=  mj 

mj  :=  Temp 

n,+i  :=  X  where  (ni,x)  €  cj 
Pi  •—  (n,',Tlj+i) 
t  :=  i  +  1 
t  ;=  t  +  1 

Cycle2(iV,P,A/,C,j,i,t) 

(D2)  else  if  [[d(i/,n,)  =  de(i/,Cj)  =  (t  —  |cj|)]  and  [mj  =  1]]  where  €  Cj 
then  :=  x  where  (n,-,x)  €  cj 

Pi  ;=  (ni,ni+i) 

i  ;=  i  +  1 

mj  :=  mj  +  1 

Cycle2(iV,  P,  Af,  C,j,  i,  t) 

else  if  [[d{i/,  n.)  =  de{i',Cj)  =  t]  and  [mj  =  2]]  where  n,  €  Cj 
then  j  :=  j  —  I 

Cycle2(iV,  P,  M,  C,  j,  i,  t) 
else  if  mj  =  1 

then  nj^.1  :=  x  where  (nj,x)  €  Cj 
Pi  :=  (nj,n.+i) 
t  :=  i  +  1 
t:=t  +  l 

Cyc\e2{N,  P,  M,  C,j,i,t) 
else  if  mj  =  2 

then  nj+i  :=  x  where  (n,,  x)  €  Cj 
p,  :=  (ui,  x) 

*  :=  t  +  1 
<:=<-! 

Cycle2(iV,  P,  M,  C,  j,  i,t) 
else  end 


(D3) 

(D4) 

(D5) 

(D6) 
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Figure  16.  A  digraph  to  illustrate  Algorithm  D 


The  following  is  a  brief  description  of  the  conditional  lines  in  Algorithm  D: 


1.  Line  (Dl)  insures  that: 

(a)  There  are  de{i/,ci)  <7-arcs  in  the  walk  when  c/  is  initially  entered  since 
de(i/,c/)  =  t, 

(b)  No  arc  on  the  cycle  c/  has  previously  been  traversed  (m/  =  0), 

(c)  The  distance  to  the  last  vertex  in  the  walk  equals  the  edge-distance  to  C[ 
from  the  initial  vertex  i/,  i.e.,  d{i/,ni)  =  de{i/,ci). 

(The  current  cycle  goes  from  Cj  to  Cj+i.) 

2.  Line  (D2)  insures  that  at  the  completion  of  one  traversal  of  the  cycle  Cj  another 
cycle  is  not  entered  for  the  first  time,  rather,  the  walk  continues  to  traverse  Cj 
for  a  second  time, 

3.  Line  (D3)  insures  that  when  every  arc  in  a  cycle  Cj  is  traversed  exactly  twice, 
the  walk  continues  along  the  cycle  Cj_i. 

4.  Lines  (D4)  and  (D5)  guide  the  walk  along  the  current  cycle  Cj  unless  it  is  time 
to  initially  enter  the  cycle  cj+i  (Dl),  to  reenter  the  cycle  Cj_i  (D3),  or  to  begin 
the  second  traversal  around  the  cycle  Cj  (D2).  Line  (D4)  is  satisfied  during  the 
first  traversal  of  Cj  while  (D5)  is  satisfied  during  the  second  traversal  of  cj. 

5.  Line  (D6)  ends  the  procedure. 
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To  .ilustrate  Algorithm  D,  consider  the  digraph  D  in  Figure  16.  Let  C  be 
the  edge-factor  consisting  of  the  cycles  X  —  (07,09,09),  Y  =  (01,05,06,010),  and 
Z  =  (02,03,04).  Initially  set  ni  =  vi\  p,  :=  empty,  1  <  s  <  20;  Af  =  [0,0,0]; 
C  =  [X,r,Z];j:=0;i:=l;t:=0; 

The  conditional  lines  in  Algorithm  D  axe  satisfied  in  the  following  order: 


•  Cycle2(X,  P,  M,  C,0, 1,0)  is  called  and  the  conditions  for  line  D1  are  satis¬ 
fied  with  c/  =  C2  =  y.  The  values  for  the  parameters  are  currently  j  =  1, 
C  =  [y, X, Z],  M  =  [1,0,0, ...,0],  na  =  U2i  Pi  =  <*1,  *  =  2,  and  T  =  1. 

•  Cycle2(X,  P,  A/,  C,  1,2, 1)  is  called  and  the  conditions  for  line  D1  are  satis¬ 
fied  with  c/  =  C3  =  Z.  The  values  for  the  parameters  are  currently  j  —  2, 
C  —  [X,  Z,  y],  M  =  [1,1,0],  n3  =  U3,  p2  =  02,  i  =  2,  and  T  =  2. 

•  Cycle2(X,  P,  Af,  C,  2, 2, 2)  is  called  and  the  conditions  for  line  D4  are  satisfied. 
The  values  for  the  parameters  are  currently  j  =  2,  C  =  [X,  Z,  y],  Af  =  [1, 1. 0], 
^4  =  V4,  P3  =  <*3,  i  —  3,  and  T  =  3. 

•  Cycle2(X,  P,  Af,  C,  2, 3, 3)  is  called  and  the  conditions  for  line  D4  are  satisfied. 
The  values  for  the  parameters  are  currently  j  =  2,  C  =  [X,  Z,  y],  Af  =  [1,1,0], 
ns  =  t;2)  Pa  =  <*41  i  =  4,  and  T  —  4. 

•  Cycle2(X,  P,  Af,C,  2,4,4)  is  called  and  the  conditions  for  line  D2  are 

satisfied  with  Cj  =  =  Y.  The  values  for  the  parameters  are  currently 

j  =  2,  C  =  [X,  Z,  y],  Af  =  [1, 2, 0],  ne  =  U3,  ps  =  02,  i  =  4,  and  T  =  3. 

•  Cycle2(X,P,Af,C,  2,4,3)  is  called  and  the  conditions  for  D5  are  satisfied. 
The  values  for  the  parameters  2u-e  currently  j  =  2,C  =  [X,  Z,  y],  Af  =  [1, 2, 0], 
«7  =  V4,  pe  =  03,  *  =  4,  and  T  =  2. 

•  The  remaiining  conditional  lines  are  satisfied  as  follows:  D5,  D3,  D4,  D4,  D4, 
D2,  D5,  D5,  Dl,  D4,  D4,  D2,  D5,  D5,  D3,  D4,  and  D6  ends  the  program. 


We  find  the  sequence  of  arcs  traversed  to  be 


P  =  [fll,  O2,  O3,  04,  02,  O3,  O4,  O5,  06,  Oio,  Oi,  O5,  06,  09,  Os,  O7,  O9,  Os,  0?,  Oiq]. 
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We  now  show  that  Algorithm  D  constructs  a  double  Eulerian  cycle  W  whose 
measure  achieves  the  lower  bound  given  in  Theorem  IV.  16.  First  we  need  to  show  a 


relationship  between  adjacent  cycles  in  an  edge-factor  of  a  digraph. 

Lemma  IV.17  For  an  edge-factor  F  of  D  =  (V,  A),  letC  €.  F  and  v  a  vertex 
where  de{v,C)  >  0.  There  exists  a  cycle  P  ^  F  adjacent  to  C  such  that 
dt{v,P)  <  dt{v,C). 

Proof:  Let  F  be  an  edge-factor  of  D  where  C^P  €  F.  Let  d{v,x)  =  de{v,C) 
where  x  €  (0)^  and  {oi, . . is  the  set  of  arcs  incident  to  x.  Without  loss 
of  generzdity,  let  Oj  and  02  be  arcs  in  C.  Then  the  minimal  length  walk  W 
between  v  and  x  does  not  include  either  of  aj  and  02  else  there  is  a  vertex 
z  €  (C)v  such  that  d{v,z)  <  d{v,x).  Without  loss  of  gererality,  let  a*  be  the 
arc  incident  to  i  in  W.  Therefore,  C  is  adjacent  to  a  cycle  P  where  x  €  (P)v 
and  de(u,  P)  <  de(v,  C).  ■ 

We  now  show  that  the  walk  constructed  by  Algorithm  D  visits  every  edge 
exactly  twice. 

Lemma  IV.  18  For  a  given  edge-factor  F  =  {Ci,C2, . . .  ,C/}  of  the  digraph 
D  =  (V,  A)  and  a  vertex  v  €  V,  Algorithm  D  constructs  a  double  Eulerian 
cycle  along  D  beginning  at  vertex  v. 


68 


Proof:  We  prove,  by  strong  induction  on  the  edge-distance  to  a  cycle  C  €  F 
from  the  initial  vertex  u,  that  each  arc  in  D  is  traversed  exactly  twice. 

Basis  Step:  Let  F  be  an  edge-factor  of  a  digraph  D.  The  initial  step  of  the 
walk  beginning  at  vertex  v  is  along  a  cycle  Co  where  de(v,  Co)  =  0.  The  walk 
continues  along  the  arcs  of  Co  until  each  of  the  3  criteria  (on  page  64)  are 
satisfied  for  a  new  cycle  to  enter  the  walk.  The  edge-distance  from  u  to  a  new 
cycle  added  to  is  greater  than  the  distance  of  the  current  cycle.  As  there 
are  only  a  finite  number  of  arcs  in  each  cycle  and  only  a  finite  number  of  cycles, 
eventually  during  the  traversal  of  any  cycle  there  will  be  no  adjacent  cycles  of 
greater  edge-distance  that  have  not  been  initially  entered.  When  the  current 
cycle  is  exited  it  is  to  a  cycle  of  smaller  edge-distance  from  v.  This  can  only 
occur  after  each  of  the  arcs  of  the  current  cycle  have  been  traversed  exactly 
twice.  Since  there  are  only  a  finite  number  of  cycles  and  Co  is  the  cycle  of 
smallest  distance  on  the  walk,  Co  must  eventually  be  revisited  and  traversed 
exactly  twice.  A  similar  argument  insures  that  every  cycle  of  distance  0  is 
traversed  exactly  twice. 

Inductive  Step:  We  now  assume  that  the  algorithm  traverses  each  cycle  Cn  of 
distance  n  from  v  exactly  twice  for  al\n  <  k.  Let  Ck  be  a  cycle  of  edge-distance 
k  from  V.  By  Lemma  IV.  17  there  exists  an  arc  of  edge-distance  fc  —  1  on 
a  cycle  Cj  adjacent  to  Ck  where  dt{v,Ci)  =  i,  0  <  i  <  k.  Therefore,  on  the 
first  or  second  traversal  of  Cj,  the  conditions  for  the  new  cycle  Ck  to  enter  the 
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walk  are  satisfied  and  C*  is  entered.  From  the  discussion  above,  Ck  will  then 


be  traversed  exactly  twice.  ■ 

We  have  seen  that  Algorithm  D  completes  a  double  Eulerian  cycle.  We  now 
discuss  the  number  of  a-arcs  existant  at  each  step  of  the  walk  constructed  by  Al¬ 
gorithm  D.  The  following  lemma  shows  that  we  can  obtain  the  lower  bound  of 
Theorem  IV.  16. 


Lemma  FV.IQ  Let  F  be  an  edge-factor  of  the  digraph  D  =  {V,A).  The  min¬ 
imum  measure  of  a  cycle  kFj,  generated  by  Algorithm  D  is  ^  \C\^  4-  Pv{F), 


where  Pv{F)  is  the  posture  for  the  factor  F. 


C&F 


Proof:  By  Lemma  IV.  18,  Algorithm  D  produces  a  walk  that  traverses 

each  arc  of  an  Eulerian  digraph  exactly  2  times.  Furthermore,  when  the  first 

ajc,  Qi,  in  a  cycle  Cj  of  distance  j  from  v  is  traversed  in  Algorithm  D  for 

the  first  time,  there  are  exactly  j  <7-arcs  along  W2,.  This  follows  from  line 

(Dl)  of  Algorithm  D,  the  conditional  statement  governing  when  new  cycles 

can  initially  enter  W2,.  When  the  fc**  arc  q*  in  cycle  Cj  is  traversed  for  the 

first  time,  there  are  exactly  —  l  <7-arcs  in  W^.  Therefore,  the  first  traversal 

of  each  arc  in  Cj  adds  -f-  A:  —  1)  to  the  measure  of  the  walk.  When  the 

fc=i 

arc  of  the  cycle  Cj  is  traversed  for  the  second  time,  there  are  j  -I-  1C|  -I- 1  —  Ar 


<T-arcs  in  IVj,.  Therefore,  the  second  traversal  of  each  of  the  arcs  in  Cj  adds 


IC,I 

£(>  +  |C|  +  1  —  Jb)  to  the  measure  of  fVj,-  It  follows  then  that  including  the 

ksl 

cycle  Cj  in  the  walk  VVj,  contributes 

/\c\  \  /\c\ 

(530  +  ^  ~  I)  j  +  ( +  ICl  +  1  -  A;) 

(J  -b  |C|  -  l)(j  +  \C\)  U  -  l)(i)  (i  -H  \C\){j  +  \C\  +  1)  (j)(j  +  1) 

2  2  2  2 

=  jCp  +  2j|Cl  to  the  measure  of  W2,.  Summing  over  all  cycles  in  F  yields 

5:(ici=+2ii(7i)  =  i;(ici=+2rf.(«,c)ici)=  Y.  ici"+/>v(n 

C€F  C€F  C€F 

Hence,  the  minimum  measure  of  W2,  ismin  5]  |C1*  +  ^„(F).  ■ 

^  C€F 


Theorem  rV.20  Let  F  be  an  edge-factor  of  the  digraph  D  =  (V,A).  The 

value  of  D  is  min  ^  |C1*  +  Pv{F),  where  pv{F)  is  the  posture  for  the  factor 
^  C^F 


F. 


Proof;  Follows  directly  from  Lemma  IV.  19  and  Lemma  IV.  16.  ■ 


Corollary  rV.21  Let  D  be  an  Eulerian  digraph.  Let  be  a  double  Eulerian 
cycle  on  D  that  produces  the  value  of  the  graph.  Then  v  €  Mf{D). 

Proof:  Follows  immediately  from  Theorem  IV.20  and  the  definition  of  Mf{D), 
the  mean  of  the  factor  F.  ■ 
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Distance  from  v 


Size  of  the  Cycle 


1 

2 

3 

4 

5 

6 

7 

0 

1 

2 

3 

4 

5 

6 

7 

1 

3 

4 

5 

6 

7 

8 

9 

2 

5 

6 

7 

8 

9 

10 

11 

3 

7 

8 

9 

10 

11 

12 

13 

4 

9 

10 

11 

12 

13 

14 

15 

Table  V.  Average  cost  of  arcs  in  particular  cycles 

We  have  shown  that  the  value,  i/,  of  an  Eulerizin  digraph  D  is  given  by 

u  =  minY:\C\^  +  Pv{F),  (IV.l) 

^  C€F 

where  F  is  an  edge-factor  of  D. 

Table  V  lists  the  average  amount  eax:h  arc  a  adds  to  the  measure  of  the  walk 
when  a  is  in  a  cycle  of  size  |C|  for  1  <  |C|  <  7  where  0  <  de{v,  C)  <  4  from  the  initial 
vertex  v.  It  follows  from  the  discussion  above  that  the  value  in  the  {i,j)  position  in 
Table  V  is  2i  +  j. 

We  can  modify  algorithms  that  find  the  median  of  a  graph  in  a  straightforward 
manner  to  find  the  posture  of  a  directed  graph  for  a  given  edge-factor.  Finding  all  the 
edge-factors  of  a  directed  graph,  however,  can  be  extremely  difficult.  It  is  not  obvious 
how  to  determine  a  priori  an  edge-factor  that  produces  the  double  Eulerian  cycle  of 
minimum  measure,  hence  the  value,  of  a  digraph.  The  following  two  lemmas,  however, 
remove  certain  edge-factors  of  a  digraph  D  from  consideration  when  generating  a  cycle 
with  minimum  measure. 
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Lemma  rV.22  Let  F  be  a  reducible  edge-factor  of  D  =  (V,A)  where  X  €  F 
can  be  reduced  to  cycles  A  and  B.  Then 

C€F>i,b  C€-F 

Proof:  If  X  €  F  can  be  reduced  to  the  two  cycles  A  and  B  then  either 
de{v,  A)  or  de{v,  B)  (or  both)  are  equal  to  dt{v,  X).  Without  loss  of  generality 
let  de(v,A)  =  de(v,X).  But  dt(v,B)  <  de(v,X)  +  [^J,  since  B  can  be  at 
most  half  way  around  A.  It  follows  that 

|Xl'  +  2(d.(u,A'))|X|  =  |A+B|’  +  2(d.(u,X))|A+B| 

=  1A|=  +  1B1»  +  2|A||B|  +  2(<I.(u,Jif))|A|  +  2(d.(v,;f  ))|B| 
=  |A|“  +  |B|'  +  2(d,(u,  A))|A|  +  2|fl|(|A|  +  d,(v,X)) 

>  |Ap  +  |B|*  +  2(<(.(u,A))|A|  +  2|B|^  1|1  +d.{v,X)\ 

>  I A|'  +  2(<i.(u, A))|A1  +  |B|’  +  2{d,(v,  B))1B1. 

Therefore, 

5;  \c\’‘ + d,(u,  c)  <  icp + <f,(u,  c). 

C^Fa,b  C^F 

■ 

Corollary  rV.23  Let  F  be  a  reducible  edge-factor  of  D  =  {V,A).  Then  a 
double  Eulerian  cycle  that  defines  F  does  not  produce  the  minimum  measure 
for  D. 
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Proof:  Follows  directly  from  Lemma  IV .22  and  Theorem  IV.  19. 


Equation  IV.  1  suggests  that  minimizing  the  sum  of  (|C|^)'s  in  the  edge-factor 
should  reduce  the  measure  of  a  double  Eulerian  cycle.  One  would  expect  the  cycles 
in  the  edge-factor  that  yield  a  double  Eulerian  walk  of  minimum  measure  to  manifest 
the  same  properties  as  cycles  that  minimize  the  sum  of  the  squares  of  their  lengths. 

Lemma  IV. 24  Let  A  =  {Ai  >  A2  >  . . .  >  A«  >  0}  6c  a  partition  of  the 

t 

positive  integer  K.  Then  A  is  the  partition  that  minimizes  ^  Ai^  if  and  only  if 

t=i 

Ai  —  At  <  1 

Proof:  Let  A  be  the  partition  that  minimizes  A,^.  Assume  Ai  —  At  >  2. 

Then 

(Aj  —  1)^  -f  (At  +  1)^  =  Ai^  -f  1  —  2Ai  -J-  At^  -b  1  2At 

=  Ai^ At* -h  2 -I- 2(Af  -  Ai). 

Since  At  —  Ai  <  —2  it  follows  directly  that  (Ai)*  -I-  (At)*  >  (Ai  —  1)*  -I-  (At  -I- 1)*. 

Therefore,  Ai  and  At  are  not  in  the  partition  A  that  minimizes  the  sum  of  the 

squau’es.  ■ 

Loosely  speaking,  a  factor  whose  cycles  are  as  nearly  as  possible  equal  sized 
should  produce  the  smallest  measures.  In  the  next  section  we  apply  this  heuristic  to 
the  Good  -  de  Bruijn  digraph  to  obtaun  the  values  of  these  digraphs. 
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D.  THE  VALUE  OF  THE  DE  BRUIJN  DIGRAPH 

Lemma  IV.24  shows  that  an  edge-factor  that  minimizes  the  sum  of  the  squares 
of  the  cycle  lengths  in  a  digraph  minimizes  the  differences  between  cycle  length.  Every 
edge-factor  of  includes  at  least  one  cycle  of  length  at  least  n  -f  1 .  For  example, 
the  (n  -1-  l)-tuple  000 ...  1  representing  an  arc  in  Bn  must  belong  to  a  cycle  of  length 
greater  than  or  equal  to  n  -I- 1.  Additionally,  if  arc  a,  represented  by  the  (n  -|-  l)-tuple 
of  all  zeros,  000 ...  0,  is  not  a  member  of  the  cycle  C  =  (0)  of  length  1,  then  it  is 
an  arc  in  a  reducible  cycle.  Corollary  IV.23  showed  that  reducible  cycles  are  never 
permitted  in  the  factor  that  generates  a  walk  of  minimum  measure.  Therefore,  the 
cycle  (0)  will  appear  in  the  factor  yielding  the  value  of  the  Good  -  de  Bruijn  digraph 
Bn-  It  follows  that  the  minimum  difference  between  the  lengths  of  cycles  is  at  least  n 
for  any  edge-factor  of  B„  that  produces  the  value.  Therefore,  if  F  is  an  edge-factor  of 
irreducible  cycles  in  B„  there  have  to  exist  cycles  in  F  whose  lengths  differ  by  at  least 
n.  We  would  want  the  lengths  of  the  cycles  in  F  to  be  equal,  but  we  now  see  that  F 
must  have  a  cycle  of  length  1  and  a  cycle  of  at  least  n  -I- 1.  The  following  theorem  by 
Golomb  [Ref.  6]  insures  that  the  cycle  sizes  in  the  edge-factor  PC  Rn+i  differ  by  at 
most  n. 

Theorem  rV.25  (Golomb)  Let  the  edge-factor  of  Bn  be  generated  by  the 
PCRn+i-  Then  the  only  cycles  appearing  are  those  of  length  d,  where  d|(n-t- 1). 

In  [Ref.  6]  it  is  shown  that  there  are  two  possible  cycles  of  length  1,  (0)  and 
(1),  only  one  cycle  of  length  2,  (10),  two  cycles  of  length  3,  (011)  and  (001),  and  in 
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d\n 

summation  is  extended  over  aii  divisors  of  n. 


The  value  of  is  a  function  of  the  edge-factor  of  and  also  of  the  distances 
to  the  respective  cycles  of  the  factor  in  the  underlying  graph  of  Bn-  Further,  the 
distance  between  cycles  in  an  edge-factor  of  B„  and  the  initial  vertex  v  of  the  minimum 
double  Eulerian  cycle  affect  the  value  of  B„.  A  closer  examination  of  Equation  IV.  1 
suggests  that  if  the  distance  to  cycles  of  larger  length  in  F  from  the  initial  vertex  v 
tend  to  be  smaller  than  the  distances  to  cycles  of  smaller  length,  the  edge-factor  F 
should  generate  a  smaller  measure. 

The  directed  distance,  dx)(x,y),  between  x  =  (xo,xi, . . .  and 

y  =  (j/OjI/i, •  •  •  ,yn-i)  in  Bn  is  n  —  7  where  7  is  the  longest  string  of  consecu¬ 
tive  bits  where  . . . ,Xn-i)  =  (yo,yi,---,y-y-i)-  For  example,  in  B4, 

dD(0001, 1001)  =  4  —  1  =  3  since  the  longest  string  shared  by  the  last  consecutive 
bits  in  0001  and  the  first  consecutive  bits  in  1001  is  the  single  bit,  1.  It  follows 
immediately  that  the  directed  distance  between  any  two  vertices  in  B„  is  at  most  n. 

The  distance  d(x,y)  between  x  and  y  in  Bn  is  n  —  C  where  ^  =  max(7,«,c) 
and 


•  K  is  the  longest  string  of  consecutive  bits  satisfying  the  equality 

(yn— yn— <t+l )  •  •  •  1  yn— 1)  —  II?  •  •  ■  »  l)* 

•  c  =  2(n  —  u)  where  1/  is  the  longest  string  of  consecutive  bits  shared  by  both 
X  and  y. 

•  7  is  defined  above. 
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In  B4,  for  example,  </(0001, 1001)  =  4  —  max(l,0, 2)  =  2.  This  observation 
sOlows  for  a  simple  way  to  find  the  edge-distance  de{v,C)  in  Gb„  between  a  vertex  t; 


and  a  cycle  C  generated  from  the  PCRn+i. 

Lemma  rV.26  Let  F  be  the  PCRn+i  edge-factor  of  Bn.  Let  S  be  the  distance 
from  a  vertex  u  €  V  to  a  cycle  C  in  F .  Then  there  exists  a  directed  path 
between  v  and  x  €  {C)v  of  length  S  in  Bn- 

Proof:  Follows  directly  from  the  discussion  above.  ■ 

Lemma  IV. 27  Let  F  be  the  edge-factor  of  Bn  generated  by  PCf2n+i, 

u  =  00 . . . Oil . . .  1.  and  a  =  \C\  for  some  C  €  F.  Then  d{v, C)  >  n  —  a. 

"1^  r?i 

Proof:  If  C  is  a  short  cycle  in  PCRn+i,  i.e.,  jCj  =  a  <  (n  -t- 1)  and  (r|(n  -f  1), 
then  at  most  <7  bits  of  any  vertex  in  (C')„  coincide  with  any  cyclic  shift  of  the 
n-tuple,  V  =  00...011...1.  Therefore,  the  shortest  path  from  u  to  C  is  of 
length  at  least  n  —  <7.  ■ 

From  Theorem  IV. 25  we  see  that  the  PCRn+i  edge-factor  possesses  the  prop¬ 
erties  that  should  minimize  the  sum  of  the  squares  of  the  cycle  lengths.  From 
Lemma  IV.27,  we  find  that  in  the  PCRn-»-i  edge-factor  the  lower  bound  on  the  dis¬ 
tance  between  larger  length  cycles  and  the  initied  vertex  v  =  00 ...  OJ 1 ...  1.  is  always 

Lfj  r?i 

smadler  than  the  lower  bound  on  the  distance  between  smaller  cycles  and  v. 
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The  following  two  theorems  insure  that  the  PCR„+i  cycles  in  Bn  are  irre¬ 
ducible.  Proofs  of  the  theorems  can  be  found  in  Mykkeltveit  [Ref.  20]. 

Theorem  IV.28  The  maximum  number  of  adjacencies  between  tut  arbitrary 
cycles  in  the  PCRn+i  is  2. 

Theorem  IV.29  There  cannot  be  more  than  one  adjacency  between  a  cycle 
with  submaximal  length  (<  n  1)  and  any  other  cycle  in  the  PCRn+i-  Two 
cycles,  both  of  which  have  submaximal  length,  can  not  have  any  adjacencies. 

The  ideas  and  concepts  developed  in  this  section  and  an  exhaustive  seau'ch  of 
all  double  Eulerian  walks  in  1  <  n  <  5  support  the  following  conjecture: 

Conjecture  IV.30  Let  the  PCRn+i  cycles  be  the  edge-factor  F  of  the  digraph 

Bn  and  u  =  00 ...  0  11 ...  1  the  inititd  vertex  in  a  double  F  rian  walk  produced 
IfJ  Ffl 

by  Algorithm  D.  Then  W„  generates  the  minimum  measure  over  all  double 
Eulerian  walks  on  Bn- 

The  PCRn+i  factor  can  be  constructed  in  zui  efficient  fashion  using  the  0  Al¬ 
gorithm  of  Fredricksen  tind  Kessler  [Ref.  26).  Table  VI  provides  the  measure  of  the 
double  Eulerian  cycle  W2,  generated  by  Algorithm  D  using  the  edge-factor  of  the 

PCRn+i  and  v  =  00 . . .  0.1 1 . . .  1  for  1  <  n  <  11.  The  numbers  for  1  <  n  <  4  were 
L?j  rfi 

found  independently  by  Harper  [Ref.  27]. 
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n 

value 

1 

8 

2 

24 

3 

72 

4 

200 

5 

524 

6 

1,400 

7 

3,420 

8 

8,352 

9 

19,476 

10 

45,232 

11 

101,722 

Table  VI.  Conjectured  values  for  1  <  n  <  11 


Figure  17.  The  Good  -  de  Bruijn  digraph  with  a  cycle  generated  by  Algorithm  D 
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It  turns  out  that  the  double  Eulerian  cycles  generated  by  Algorithm  D  using 

the  edge-factor  F  of  the  PCRn+i  and  v  =  00. . .  OJl . . .  1,  are  not  the  only  walks  of 

ifj 

minimum  measure  in  Bn-  In  Figure  17,  for  Example,  a  double  Eulerian  W2iik 

oio,  On,  Os,  ag,  oio,  ou,  cis,  a^,  Ois,  Oij,  an,  oj4,  ais,  an,  ais,  an ) 

is  generated  by  Algorithm  D  using  the  edge-factor  of  PCR4  and  v  =  001,  By  an 
exhaustive  search,  we  know  this  walk  has  the  minimum  measure  of  72  for  any  double 
Eulerian  cycle  on  B3.  Wi  was  generated  using  cycles  of  lengths  4,4,4,2,1,  and  1.  The 
double  Eulerian  cycle 

=  (0i,a2,a2,a3,a4,as,ai,a3,a4,a6,a7,  as,  07,05,03,09, 

Ol4,  Ol5,  On,  Og,  09,  0x4,  On,  Ois,  015,  O12, 013,  OlO,  O12,  Oi3,  On,  On) 

also  has  a  measure  of  72  and  uses  cycles  of  lengths  5,4,3, 2,1,  and  1.  Thus,  we  see 
that  walks  of  minimum  measure  are  not  restricted  to  the  PC /?n+i  edge-factor.  We 
have  not,  however,  found  an  edge-factor  in  Bn  where  the  length  of  the  leu-gest  cycle 
is  greater  than  n  4-  2  that  also  generates  a  walk  of  minimum  measure  in  B„. 
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V 


A  RANDOMNESS  PROPERTY  OF  DE 
BRUIJN  CYCLES 

“Living  backwards,”  Alice  repeated  in  great  astonishment.  “I  never  heard  of 
such  a  thing.  ” 

“...but  there’s  one  great  advantage  in  it,  that  one’s  memory  works  both  ways.” 

“I’m  sure  mine  only  works  one  way,  ”  Alice  remarked.  “I  can’t  remember  things 
before  they  happen.  ” 

“It ’s  a  poor  sort  of  memory  that  only  works  backwards,  ”  the  Queen  remarked. 

Through  the  looking  glass,  Chapter  5 


A.  INTRODUCTION 

A  strong  statistical  dependence  exits  between  the  predecessor  and  successor  of 
each  state  of  a  binary  feedback  shift  register  (FSR).  The  consequences  of  randomly 
selecting  each  successor  state  from  a  choice  of  two  possible  states  proves  have  interest¬ 
ing  consequences.  Selecting  the  successor  state  actually  can  modeled  a  rather  simple 
Msurkov  process,  since  the  two  possibilities  can  be  assigned  probabilities.  We  show 
that  the  distribution  of  runs  in  a  binary  de  Bruijn  cycle  coincides  with  the  expected 
distribution  of  runs  for  a  binary  sequence  of  length  2"  generated  by  randomly  selecting 
the  successor  state  in  a  binary  FSR. 

B.  THE  BALANCE  PROPERTY 

A  binary  sequence  S  is  said  to  have  the  balance  property  if  the  number  of  I’s 


in  S  equals  the  number  of  O’s  in  5  [Ref.  6]. 
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Theorem  V.l  A  de  Bruijn  cycle  has  the  balance  property. 

Proof:  A  de  Bruijn  cycle  of  length  2"  has  the  property  that  every  n  consecu¬ 
tive  bits  are  different  on  a  given  period.  Expressed  in  decimal  notation,  each 
n-tuple  can  be  thought  of  as  representing  an  integer  from  0  to  2"  -  1.  In  this 
range  there  are  2’*“^  odd  integers  and  2"~^  even  integers.  Thus,  a  de  Bruijn 
cycle  contains  2"~*  I’s  and  2”“*  O’s  and  will  therefore  always  possess  the  bsJ- 
ance  property.  ■ 

By  a  simple  counting  argument  we  find  that  the  size  of  the  set  of  all  balanced 
binary  sequences  of  length  2"  is  where  =  C(n,r)  is  the  binomial  coefficient 

of  n  things  taken  r  at  a  time.  A  de  Bruijn  cycle  can  be  normalized  by  requiring  that 
the  sequence  begin  with  exactly  n  consecutive  O’s.  There  are  then  normalized 

sequences  of  length  2”  that  possess  the  balemce  property.  This  follows  since  the  n 
consecutive  O’s  must  be  preceded  and  followed  by  a  1. 

For  example,  when  searching  the  2®-long  binary  sequences  for  the  2,048  de  Bruijn 
cycles,  there  are  =  60,108,390  sequences  of  length  2®  that  possess  the  balance 
property.  There  are,  however,  only  =  4,457,400  normalized  balanced  se¬ 

quences  of  length  2®. 
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C.  THE  RUN  PROPERTY 

A  binary  sequence  5  is  said  to  have  the  run  property  if  among  the  runs  of 
ZEROs  (and  ONEs)  in  5,  one-half  of  the  runs  au-e  of  length  one,  one-fourth  are  of 
length  two,  one-eighth  are  of  length  three,  and  so  on,  as  long  as  these  fractions  provide 
an  integer  value  for  the  number  of  runs. 

Theorem  V.2  A  de  Bruijn  cycle  has  the  run  property. 

Proof:  The  run  structure  of  a  de  Bruijn  cycle  5„  can  be  completely  deter¬ 
mined.  A  run  of  ZEROs  of  length  n  occurs  exactly  once  in  5„.  This  run 
of  ZEROs  of  length  n  must  be  preceded  and  followed  by  a  1,  or  the  n-tuple 
(00  ’OO)  would  appear  at  least  twice  in  5n.  The  n-tuple  (100...0)  of  a  1 
followed  by  (n  —  1)  O’s  occurs  exactly  once  in  the  sequence.  This,  however,  is 
already  2iccounted  for  by  the  run  of  ZEROs  of  length  n.  Thus,  there  is  no  run 
of  ZEROs  of  length  n  —  1.  To  find  the  number  of  runs  of  ZEROs  of  length  fc, 
for  1  <  k  <  n  —  2,  yve  consider  ail  n  consecutive  terms  of  the  sequence  that 
begin  with  a  1,  then  the  /c-tuple  (00. . .  0),  and  then  a  1.  Each  such  run  can  be 
made  to  correspond  to  an  n-tuple  t  of  the  form 

t  =  1 QO . . .  0.1  XX . . .  X., 

k  n—k—2 

where  the  x’s  are  chosen  as  arbitrary  bits.  Since  we  are  free  to  choose  each  of 
the  remaining  n  —  k  —  2  components,  there  are  runs  of  ZEROs  of  length 
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n  Run  Distribution:  D„ 

1  m 

2  {2} 

3  {1,3} 

4  (1,1, 2, 4} 

5  {1,1, 1,1, 2,2, 3, 5} 

6  {1,1, 1,1, 1,1,1, 1,2, 2, 2, 2,3, 3, 4, 6} 

Table  VII.  Distribution  of  runs  in  5„ 

A:  for  1  <  k  <  n  —  2.  With  the  single  run  of  ZEROs  of  length  n,  the  result 
follows.  The  same  distribution  holds  for  runs  of  ONEs.  ■ 

The  multiset  of  the  lengths  of  the  runs  of  ZEROs  (ONEs)  in  a  2" -long  de  Bruijn 
sequence  is  denoted  by  Dn-  Table  VII  is  a  list  of  the  ZERO  and  ONE  run  distribu¬ 
tions,  Dn,  for  a  sequence  Sn  €  Sn,  where  1  <  n  <  6.  Any  sequence  of  length  2" 
whose  run  distribution  is  a  permutation  of  the  multiset  has  the  run  property.  We 
alternatively  indicate  such  multisets  by  specifying  the  number  of  times  each  different 
type  of  element  occurs.  Thus,  D®  can  also  be  denoted  by  {4  •  1, 2  •  2, 1  •  3, 1  •  5}  where 
4,  2,  1,  and  1  are  the  repetition  numbers  of  the  entries  1,2,3,  and  5,  respectively.  The 
repetition  number  of  each  element  k  €  D„  is  denoted  by  ro^ik),  where 

2n-k-2  ifl<ifc<„_2 

ronik)  =  (V.l) 

1  if  fc  =  n. 

A  binary  sequence  can  be  interpreted  as  a  sequence  of  integers  representing 
the  lengths  of  the  alternating  runs  of  ZERO'S  and  runs  of  ONE’s  in  the  sequence.  We 
have  seen  that  a  binary  2’*-long  sequence  5  corresponds  to  the  run  sequence  R  [See 
Chapter  III,  Section  C].  The  run  sequence  R  consists  of  the  subsequence  Zn  of  the 
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lengths  of  the  runs  of  ZERO’S  interleaved  with  the  subsequence  0„  of  the  lengths  of 
the  runs  of  ONE’s.  The  sequences  £„  and  On  for  any  de  Bruijn  cycle  5„  of  length  2" 
will  always  be  permutations  of  £)«  because  of  the  run  properties  of  5„. 

In  an  arbitrary  binary  sequence  of  length  2”  possessing  the  run  property,  there 

(20,2>,2*,.l2"-s)  ~  2<>'!2>t2^!.!.2"-*!  ZEROs  may  be  arranged. 

We  can,  however,  normalize  the  binary  sequence  (i.e.,  let  it  begin  with  the  string  of  n 

consecutive  ZEROs),  and  then  there  are  only  (j,  *  ji-s)  ways  to  position  the  2"“* 

runs  of  ZEROs.  Hence,  there  axe 

(  2-2  )(  2-2-1  \ 

V20,2S22, . . . ,2"-V  V2»,22, . . . ,2-3^ 

normalized  binary  sequences  of  length  2"  possessing  the  run  property  where  the  first 
factor  counts  the  runs  of  ONEs  and  the  second  factor  counts  the  runs  of  ZEROs. 

As  an  example,  in  a  normalized  binary  sequence  of  length  2^  possessing  the 
run  property,  there  axe  =  105  ways  that  the  runs  of  ZEROs  may  be  arranged. 
The  runs  of  ONEs  can  be  independently  arranged  in  (1*4)  =  840  ways.  Hence,  the 
2,048  de  Bruijn  cycles  of  length  2®  are  a  subset  of  the  105  x  840  =  88, 200  normalized 
binary  sequences  possessing  the  run  property. 

Clearly,  not  every  permutation  of  Zn  and  C?„  generates  a  de  Bruijn  cycle. 
Interestingly,  there  is  no  de  Bruijn  cycle  of  length  2®  with  either  Z^  o£  O5  equal  to 
(51312121),  (51121132),  (51123211),  (51212131),  (51213121),  (52113112),  or 
(52311211).  This  begs  the  question  of  why  specific  permutations  of  D„  cannot  occur 
as  Zn  or  On  in  a  de  Bruijn  cycle  that  is  answered  in  the  following  section.  (Note: 
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When  Sn  viewed  as  a  cycle,  care  must  be  taken  not  to  break  the  run  sequences 
across  a  run  when  the  sequence  is  cycled.) 

D.  A  RANDOMNESS  PROPERTY 

A  binary  sequence  of  length  2"  having  the  balance  and  run  property  is  not 
necessarily  a  de  Bruijn  cycle.  For  example,  the  balanced  sequence  0000111100110101 
has  the  run  property,  but  clearly  this  is  not  a  de  Bruijn  cycle.  We  show  that  if  the 
run  sequences  and  0„  of  a  balance  binary  sequence  with  the  run  property  are 
arrsmged  randomly,  the  resulting  binary  sequence  is  de  Bruijn.  We  define  what  we 
mean  by  randomly  in  the  sequel. 

The  simplest  possible  non-trivial  experiment  is  one  that  may  result  in  either  of 
two  possibilities.  Such  an  experiment  is  called  a  Bernoulli  trial  and  the  two  outcomes 
are  labeled  as  1  or  0  (success  or  failure).  This  framework  is  used  in  what  follows. 

Let  A  and  B  be  copies  of  the  multiset  /?„.  Suppose  the  elements  from  the 
multiset  A  are  viewed  as  being  placed  randomly  between  the  positions  of  another 
circular  permutation  of  the  second  multiset  B.  The  interleaving  of  A  and  B  generates 
a  run  sequence  R.  (We  view  A  as  the  multiset  of  the  lengths  of  the  runs  of  ZEROs 
smd  B  as  the  multiset  of  the  lengths  of  runs  of  ONEs.)  Given  an  arbitrary  element  j 
in  B,  let  Xi  be  a  binary  indicator  variable  defined  on  j  satisfying  =  1  if  and  only 
if  an  element  1  from  A  is  to  the  immediate  right  (TTIR)  of  j,  otherwise  Xi  =  0.  The 
probability  function  for  the  discrete  random  variable  Xi  is 

PXiix)  =  P{Xi  =  x),  for  X  €  {0, 1}. 
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Since  the  element  1  €  A  accounts  for  2’*”^  of  the  2"“®  elements  in  D„,  it  follows 
immediately  that  p;f,(l)  =  .5  =  p;r,(0).  The  expected  value  for  Xi  is 

E[X,]=  5:  xpx.(x)  =  0(.5)  +  l(.5)  =  (.5). 

x€{0,l} 

The  expected  value  of  Xi  is  the  probability  that  a  1  €  >1  is  TTIR  of  an  element  in 
B.  We  know  that  the  expected  value  of  a  sum  is  the  sum  of  the  expected  values.  The 
expected  number  of  I’s  of  A  that  are  TTIR  of  all  of  the  entries  j  in  B  is  denoted 
=  2>"Dn(j)>  where  ro^{j)  is  the  repetition  number  of  the  element  j  in  the 
multiset  D„.  From  Equation  V.l,  it  follows  that 

2n-j-3  if  1  <  j  <  n  -  2 

E[\;j]  =  -  (V.2) 

2-‘  ifi  =  n. 

£^[1;  j]  is  an  integer  except  when  j  is  either  (n-2)  or  n.  We  find  it  necessary  to  specify 
the  expected  number  of  I’s  that  are  TTIR  of  (n  —  2)  pi  TTIR  of  (n)  in  B,  denoted 
jF[1;  n  -  2,  n],  so  that  an  integer  value  is  realized.  We  then  find  E[l\  n  -  2,  n]  =  1. 

Similarly,  we  define  X2  on  j  €  B  to  be  a  binary  indicator  variable  where 
X3  =  1  if  and  only  if  a  2  in  the  multiset  A  is  TTIR  of  j,  otherwise  Xj  =  0.  It  follows 
immediately  that  Pxa(l)  =  -25,  Px2(0)  =  .75  and  E[X2]  =  0.25. 

The  expected  number  of  2’s  that  are  TTIR  of  all  the  j’s  in  B,  denoted  £^[2;  j] 
is  equal  to  (.25)r/)„(j).  From  Equation  V.l,  it  follows  that 

{2"--'-“  if  1  <  j  <  n  -  2 

(V.3) 

2~^  if  j  =  n. 
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In  general,  we  define  the  random  variable  Xk  on  j  €  B  to  be  a  binary  indicator 
variable  where  X*  =  1  if  and  only  if  a  is  TTIR  of  j,  otherwise  Xk  =  0.  The 
probability  function  for  Xk  is  given  as 

2-*  if  1  <  it  <  n  -  2 

Px,{l)  =  (V.4) 

2-*+2  ifjt  =  n, 

and  Pxk(O)  =  1  -  Px„(i)-  The  expected  value  of  Xk  is  given  by 

{2-*  if  1  <  it  <  n  -  2 

(V.5) 

2-*+’  iflt  =  n. 

The  following  Lemma  provides  the  expected  number  of  elements  TTIR  of  each 
type  of  element  when  random  permutations  of  Dn  are  interleaved. 


Lemma  V.3  If  elements  from  A  =  Dn  and  B  =  Dn  are  placed  randomly  in 
alternate  order,  TTIR  of  all  the  j*s  in  B  we  expect  to  find  exactly  E[k,j]  k’s 
in  A,  with 


Elk,j\  = 


2*1— i— fc— 2 

if  1  <  j,k  <  n  —  2 

2-* 

if\<k<n  —  2  and  j  —  n 

2-j 

ifk  =  n  and  1  <  j  <  n  —  2 

2-*+2 

ifkj  =  n 

(V.6) 


and  where  the  multiset  Dn  represents  the  run  distribution  for  a  2^ -long  binary 
sequence  possessing  the  run  property. 

Proof:  Follows  directly  from  the  Equations  V.l  and  V.5  ■ 
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We  define  the  multiset 


/?!  =  {£(1;  1] .  1,£[2;  1]  •  2,...,£[n  -  3;  1]  •  n  -  3}, 

where  Ri  is  the  submultiset  of  integers  of  A  that  represent  the  elements  that  are 
expected  to  be  TTIR  of  all  the  I’s  in  B.  Recall  that  E[k;j]  =  E[Xk\rD„{j)-  Therefore, 

Rx  =  •  l,E[X2]rD„il)  •  2, .  • . ,  £[A:„.3]ri,„(l)  •  n  -  3}. 

In  like  manner,  Rj  =  {^^[1;  2]  •  1,  E[2;  2]  •  2, . . . ,  £J(n  —  4;  2]  •  n  —  4}. 

We  define  the  multiset  Rk  =  {£[l;fc]-l,  E[2]  fc]-2, . . . ,  E[n  —  k  —  2‘,  fcj-A:},  where 
1  <  k  <  n  —  2.  The  multiset  Rk  is  the  submultiset  of  integers  of  A  that  represents 
the  elements  that  are  expected  to  be  TTIR  of  all  the  fc’s  in  B. 

In  general,  let  a  be  a  submultiset  of  B  and  define  Rq  to  be  the  multiset  of 
integers  that  represents  the  elements  in  A  that  are  expected  to  be  TTIR  of  all  the 
elements  in  a.  If  or  and  are  multisets  in  B,  Ro  =  Rp  if  and  only  if  |a|  and  |/3|. 

In  a  similar  manner,  we  denote  the  multiset  of  integers  that  represent  the 
expected  elements  from  A  that  are  TTIR  of  all  the  ji's,  jVs,  jq's,  in  B  by 
.i*)'  define  Rf  =  R(k+i,k+2,...,n-2,n)~ 

Lemma  V.4  Rjf  =  Rk  for  every  I  <  k  <  n  —  2 

Proof:  We  need  to  show  that  |/:|  =  |fc|.  By  Equation  V.l,  ronik)  =  2"“*'^ 
for  1  <  fc  <  n-2  and  rD„(A:  +  l)  + - l-r£)„(n-2)  +  ru„(n)  =  2*)  + 1. 

n-fe-3 

Let  2*  =  5.  Then  2S  —  S  =  S  =  2"“*"^  —  1.  It  follows  immediately  that 

k=i 

»*£)„(*;)  =  rD„(k  +  1)  +  •  •  •  +  rc„(n  -  2)  +  rD„in).  Therefore,  Rj  =  R*.  ■ 
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As  an  exsnnple,  let  be  a  binary  sequence  whose  run  sequence  consists  of 
alternating  terms  from  two  respective  multisets  that  have  been  randomly  inter¬ 
leaved.  Let  be  the  sequence  of  run  lengths  of  ZEROs  and  C?6  the  sequence  of  run 
lengths  of  ONEs.  We  would  expect  the  following: 

1.  Exactly  four  I’s,  two  2’s,  and  one  3  in  are  TTIR  of  the  I’s  in  Oe- 

Ri  =  {1,1, 1,1, 2, 2, 3}. 

2.  Exactly  two  I’s  and  one  2  in  Z^  are  TTIR  of  the  2’s  in  Oe.  R2  =  {1, 1,2}. 

3.  Exactly  one  1  in  Z^  is  TTIR  of  the  3’s  in  Os-  R3  =  {1}. 

4.  Exactly  one  1  in  A  is  TTIR  of  either  the  4  or  6  in  Oe-  R(4.6)  =  {!}• 

Similarly,  we  may  continue  to  determine  the  expected  values  of  the  elements  that 
must  be  TTIR  of  the  elements  in  the  multisets  Ri  and  Rj.  For  example,  we  define 
the  multiset 

Ri/i  =  {£^lA’i]£:[l;  1]  •  1,  E[Xi]E[2;  1]  •  2, . . . ,  E[Xi]E[n  -  4;  1]  •  n  -  4}, 

where  Ri/i  is  the  multiset  of  integers  that  represents  the  elements  expected  TTIR  of 
all  the  I’s  in  Ri. 

In  general  we  define 

£’|2;A-,i  -2 . 

(£1X,1)‘-'  £|p;X,|  p,}, 

where  p  =  n  —  xi  —  xj  -  —  Xk  —2.  The  range  of  values  for  p  insures  integer  values 

for  the  representative  numbers  in  Rxii.../xk-  I*  follows  from  Equation  V.6  that 

Rx„/.../xi  =  {1}  when  xi  +  --  +  Xm  =  n-3.  (V.7) 
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From  Equation  V.6,  one  sees  immediately  that  E[k,j]  =  E[j,k].  Therefore,  since 
multiplication  is  commutative, 


Exi/.../xk  ^i/'lvk 

where  yi, . . .  y*  is  any  permutation  of  ii, . . .  x^.  Hence, 

E.Xi/.../Xk  ~  ^(Xi+  -+**)-  (^•^) 

Continuing  the  above  example,  we  also  expect  to  find  the  following  in  E^: 

1.  Exactly  two  I’s  and  one  2  in  Oe  are  TTIR  of  the  I’s  in  Ri.  Ri/i  =  {1, 1,2}. 

2.  Exactly  one  1  in  is  TTIR  of  the  2’s  in  i?i.  R1/2  =  {!}• 

3.  Exactly  one  1  in  Oe  is  TTIR  of  the  I’s  in  R2.  R2/1  =  {!}• 

This  process  can  continues  until  R*, =  {1}.  This  will  occur  when 
xi  + - 1-  Xm  =  n  -  3. 

Concluding  the  example,  we  also  expect  exactly  one  1  in  Oe  TTIR  of  the 
two  I’s  in  Ri/i,  therefore,  Ri/i/i  =  {1}.  Furthermore,  since  Ri  =  {1, 1, 1, 1,2,2,3}, 
it  follows  that  Rj  =  R(j, 3,4,6)  is  also  equal  to  {1, 1, 1, 1,2,2,3}.  In  a  like  manner, 
Rj  ~  R(3,4,6)  —  {I?  Ii2},  etc. 

A  binary  sequence  whose  run  structure  consists  of  alternating  terms  from  D„ 
with  the  expected  distribution  of  elements  to  the  immediate  right  and  left  of  each  like 
of  element  in  Z?„,  is  said  to  have  the  Expected  Value  Property. 
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A  binary  sequence  of  length  2”  with  the  Expected  Value  Property  exhibits  the 
following: 


2^ 


%  =  fill 


fij  =  {l,...,l,2,...,2,...(n-5),(n-4)}, 


2«— s  2**“®  2® 


/?2  =  fiji 


flfc  =  1, 2, . . . ,  2, . . .  (n  -  (fc  +  3)),  (n  -  (A:  +  2)}, 

2n-(*+3)  2"-(*+<)  2 

fifc  =  fife 


f2„-3  =  {1},  and 

fiji— 3  “  fi(n-2,n)  “  f^n— 3* 

Equations  V.7  and  V.8  have  far-reaching  implications.  In  particular,  they  provide  a 
connection  between  the  Expected  Value  Property  and  de  Bruijn  Sequences. 

Theorem  V.5  A  binary  sequence  S  of  length  2”  is  a  de  Bruijn  cycle  if  and 
only  if  the  run  sequences  of  S  possess  the  Expected  Value  Property. 

Proof;  Let  5  be  a  binary  sequence  with  the  Expected  Value  Property.  Since 
5  consists  of  alternating  terms  from  two  copies  of  Dn  it  follows  immediately 
that  S  has  length  2". 
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Let  be  the  sequence  formed  by  cyclically  shifting  S  by  j  bits  to  the  left.  The 
sequence  S  has  2"  distinct  n-tuples  if  and  only  if  for  every  j,  1  <  j  <  2",  the 
sequence  formed  by  the  sum  ©  S  does  not  contain  a  string  of  n  consecutive 
O’s. 

Let  S  possess  the  interleaved  run  sequences  On  and  2„,  each  of  which  is  a 
separate  permutation  of  D„,  for  1  <  j  <  2".  Let  T  be  the  longest  consecutive 
string  of  O’s  from  the  sum  sequence  5-'  ©  S.  The  length  of  T  is  ITI  =  t.  Let 
R  =  (ri,r2, . . .  and  W  =  .  .,Wt-i,Wt)  be  the  consecutive  bits 

from  2ind  5,  respectively,  such  that  R@W  =  T. 

Since  T  is  the  longest  consecutive  string  of  zeros,  the  bit  tq  that  immediately 
precedes  R  and  the  bit  wq  that  immediately  precedes  W  must  differ.  Without 
loss  of  generality  let  ro  =  ri  =  1.  Define  k  by  wi  —  W2  =  -  •  •  =  Wk-i  =  Wk  but 
Wk  ^  Wk+i.  Then  k  is  the  length  of  a  run  of  ONEs  in  W. 

We  now  show  by  contradiction  that  the  sequences  5  has  no  repeated  n-tuples. 
Since  the  run  lengths  in  5  are  elements  in  D„,  fc  <  n  —  2,  we  know  that 

R{kMi . n-2,n)  =  Rk-i-  With  A:  —  1  <  n  —  3,  the  properties  on  page  92  and 

equation  V.8  can  be  used.  Let  xi,X2,..  .,Xm  be  the  lengths  of  the  runs  that 
coincide  after  the  bits  r*  and  tufc.  Let  Xp,  where  p  <  m  be  the  length  of  the 

last  runs  in  R  and  W  that  concide,  where  (A:  —  1 )  -I-  xi  H - f-  Xn  <  (”  ~  3)  and 

(A;  — l)-t-xi-f - l-Xp+Xp+i  >  (n  — 3).  Let  (Ar  — l)-f-xiH - |-Xp  =  n  —  d,  where 
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>  3.  Since  Rk~ilxxi.../xp  =  Rn-d-,  it  follows  that  Xp+i  >  d  —  Z.  The  length  of 
the  longest  run  that  coincides  between  R  tmd  W  TTIR  of  Rn-d,  however,  is 

d— 3.  We  find,  therefore,  that  (/c—l)+iiH - \-Xm  =  n—g  <  n  —  Z.  The  length 

of  the  longest  string  of  bits  that  coincides  TTIR  of  Rn~g,  however,  is  g  —  2. 

Since  (fc— l)+xiH - VXm  =  n-g,  it  follows  that  fc-fXiH - |-im+5-2  =  n-l 

is  the  length  of  the  longest  string  of  bits  that  coincide.  Therefore,  S  has  no 
repeated  n-tuples  and  is  therefore  a  de  Bruijn  cycle. 

Conversely,  let  5  be  a  de  Bruijn  cycle.  Since  S  is  de  Bruijn,  it  consists  of 
alternating  terms  from  D„  and  5  has  the  Run  Property.  Consider  the  runs 
of  ONEs  of  length  r  where  0  <  r  <  n  —  2.  Each  such  run  can  be  made  to 
correspond  to  an  n-tuple  along  the  sequence  5  of  the  form 

0,11  •  •  •  •  •  X, 

r  n-r-2 

where  the  x’s  are  arbitrary  bits.  The  number  of  runs  of  ZEROs  of  length  j 
where  n  —  r—j  >  0  that  are  TTIR  of  the  runs  of  ONEs  of  length  r  is 
This  coil. -ides  with  the  expected  integer  values  given  by  Lemma  V.6.  Hence, 
the  Expected  Value  Property  for  S  follows  immediately  from  the  Run  Property 
and  the  uniqueness  of  the  n-tuples  in  a  de  Bruijn  cycle.  Therefore,  a  binary 
sequence  of  length  2"  with  the  expected  value  property  is  a  de  Bruijn  cycle,  ■ 
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Interleaving  any  permutation  of  the  run  sequences  of  Dn  for  1  <  n  <  3  gen¬ 
erates  a  de  Bruijn  cycle.  For  n  =  4  we  find  that  a  sequence  with  the  expected  value 
property  must  satisfy  the  following  condition  with  respect  to  its  run  sequences:  if  the 
sequence  of  runs  of  ONEs/ZEROs  has  2  consecutive  I’s  in  it  then  the  sequence  of  runs 
of  ZEROs/ONEs  cannot  have  2  consecutive  I’s.  This  condition  provides  a  very  simple 
way  to  generate  the  16  de  Bruijn  cycles  of  length  2".  There  are  3  different  normalized 
permutations  of  D4:  (4,2,l,l)v  (4, 1,1, 2),  and  (4, 1,2,1).  There  are  8  permutations  of 
D4  with  consecutive  I’s  :  (4,2,1,1),  (4,1, 1,2),  (2,4,1,1),  (2, 1,1,4),  (1,4,2,1),  (1,2,4,1), 
(1, 1,4,2),  and  (1, 1,2,4).  Finally,  there  are  4  permutations  of  D4  with  no  consecutive 
I’s:  (4,1,2,1),  (2, 1,4,1),  (1,4,1,2),  and  (1, 2,1,4).  The  16  de  Bruijn  cycles  are  generated 
by; 


1.  Interleaving  the  two  normalized  runs  sequences  from  D4  having  consecutive 
I’s  with  the  four  runs  sequences  from  D4  not  having  consecutive  I’s,  and 

2.  Interleaving  the  single  normalized  run  sequence  from  D4  having  no  consecutive 
I’s  with  the  eight  runs  sequences  from  D4  having  consecutive  I’s. 


Table  VIII  displays  these  sequences. 

In  the  general  case,  although  the  expected  value  property  can  be  used  to 
generate  de  Bruijn  cycles,  it  does  not  provide  a  more  efficient  means  to  construct 
them  than  appears  elsewhere. 
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4  2  11 

4,1, 2,1 

1111000011010010 

2, 1,4,1 

1111001101000010 

1,4,1, 2 

1111011000010100 

1, 2,1,4 

1111011001010000 

4, 1,1, 2 

4.1,2,! 

1111000010100110 

2,1, 4,1 

1111001010000110 

1, 4,1,2 

1111010000101100 

1,2,1, 4 

1111010010110000 

4, 1,2,1 

4,2,1,1 

1111000010011010 

4,1,1, 2 

1111000010110100 

2, 4, 1,1 

1111001000011010 

2,1,1, 4 

1111001010110000 

1,4,2, 1 

1111010000110010 

1,2,4,1 

1111010011000010 

1, 1,4,2 

1111010110000100 

1,1,2,4 

1111010110010000 

Table  VIII.  The  de  Bruijn  cycles  of  length  2*  generated  by  run  sequences 
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VI.  A  MEMORYLESS  ALGORITHM  TO 
GENERATE  DE  BRUIJN  SEQUENCES 


“To  change  and  change  for  the  better  are  two  different  things.  ” 

German  Proverb 


A.  INTRODUCTION 

In  this  chapter  we  present  an  algorithm  to  construct  a  subset  of  all  the  binary 
de  Bruijn  sequences  of  length  A  comprehensive  survey  of  previous  work  on  this 
subject  can  be  found  in  [Ref.  28].  A  common  approach  to  this  process  is  to  join 
cycles  together  to  form  a  full  cycle.  This  same  general  approach  will  be  followed  here. 
The  algorithm  presented  is  not  limited  to  joining  pure  cycles,  however,  using  the  PCR 
feedback  function  and  its  pure  cycles  produces  the  most  efficient  algorithm.  Unlike  the 
Universal  Algorithm  [Ref.  29],  the  representative  vertex  for  each  cycle  is  determined 
by  a  distance  function  on  the  digraph  Bn,  rather  than  by  finding  the  smallest  element 
among  all  the  vertices  on  a  cycle.  Our  algorithm  produces  an  Eulerian  cycle  in  B„. 
As  this  is  isomorphic  to  a  Hamiltonism  cycle  in  our  algorithm  allows  for  an 

interplay  between  vertices  and  arcs  that  provides  a  systematic  way  to  join  cycles. 
As  a  consequence,  we  decompose  the  Good  -  de  Bruijn  digraph  into  an  edge-factor 
to  insure  each  arc  is  traversed,  rather  than  into  a  factor  that  partitions  the  set  of 
vertices. 
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B.  BACKGROUND  FOR  SIMILAR  APPROACHES 

Golomb  [Ref.  6]  describes  the  key-sequence  method  to  generate  de  Bruijn  se¬ 
quences.  One  generates  the  set  of  span  n  de  Bruijn  sequences,  5„,  from  preference 
functions  (Pj*,  P,*)  that  are  obtained  from  feedback  formulas  and  from  the  preference 
functions  (PijPa)  for  the  span  n  —  1  sequences.  The  key-sequence  method  requires 
considerable  memory  storage  to  generate  the  sequences  because  every  de  Bruijn  se¬ 
quence  of  smaller  span  is  utilized  in  order  to  generate  the  de  Bruijn  sequences  of 
span  n. 

Fredricksen  [Ref.  30]  shows  how  to  generate  2^’*“®  full  cycles  of  length  2"  from 
the  prefer  ones  sequence  using  the  PC Rn.  The  method  uses  6n  bits  of  storage,  and 
n  units  of  time  to  produce  the  next  bit  from  a  given  n  bits. 

Etzion  and  Lempel  [Ref.  31]  show  how  to  generate  full  cycl«  of  length 

2“  from  the  PCRn  using  3n  -I-  {k)g(n,k)  bits  of  storage,  where  fc  is  a  constant  in  the 
range  1  <  fc  <  2^,g{n,k)  «  (n-21og  k)(l-  i+log  k^‘  time  required  to  produce 
the  next  bit  from  the  last  n  bits  is  0(n). 

C.  JOINING  OF  CYCLES 

Common  to  all  of  the  above  techniques  is  the  notion  of  cycle  joining.  When 
the  two  possible  predecessors  of  a  state  both  map  into  the  same  state  by  the  feedback 
function  /  we  say  /  is  singular.  Any  feedback  function  that  is  not  singular  is  therefore 
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nonsingular.  The  following  are  equivalent  conditions: 


1.  /  is  nonsingular. 

2.  /  produces  no  branches. 

3.  /  is  a  one  to  one  mapping. 

4.  /  is  an  onto  mapping. 

5.  Every  state  hzis  a  unique  predecessor  and  a  unique  successor. 

6.  /  produces  only  cycles. 

The  following  theorem  [Ref.  6]  is  stated  without  proof: 

Theorem  VI.  1  (Golomb)  A  necessary  and  sufficient  condition  for  distinct 

states  to  have  distinct  successors  is  that  the  corresponding  (nonsingular)  feed~ 

back  function  f:  {0, 1}  can  be  written 

f{Zot  2i,  • . . ,  Zn)  —  f\{,Z\y  Z2t  •  •  •  >  2^n)>  (^1*1) 

for  some  function  fi:  S"  {0, 1} 

Any  edge-factor  of  the  digraph  Bn  represents  a  feedback  function  that  satisfies 
equation  (VI.  1).  Furthermore,  any  feedback  function  satisfying  (VI.  1)  will  have  dis¬ 
tinct  successors  and  the  function  therefore  yields  an  edge-factor  in  the  corresponding 
Good  -  de  Bruijn  digraph. 

Since  Bn  is  a  2-regular  strongly  connected  digraph,  when  is  decomposed 
into  an  edge-factor  F  containing  two  or  more  cycles,  each  distinct  cycle  is  adjacent 
(i.e.,  arc  disjoint  and  sharing  a  common  vertex)  to  some  other  cycle.  By  the  following 
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lemma,  it  is  always  possible  to  join  together  two  adjacent  cycles  in  a  edge-factor  to 
form  a  single  cycle. 


Lemma  VI.2  Let  F  be  an  edge-factor  of  the  digraph  D.  Let  (R)  and  (T)  be 
distinct  adjacent  cycles  in  F  voith  x  €  {R)v  ^  €  (7’)v  The  cycles  (R)  and 
(T)  are  joined  into  a  single  cycle  when  the  respective  successors  of  the  vertex 
X  in  each  cycle  are  interchanged. 

Proof:  Let  the  following  sequence  of  vertices,  (R)  =  (ri,r2, . . . ,r/)  and 
(T)  =  (tj,  <2, . . . ,  tm),  represent  adjacent  cycles.  Since  the  cycles  are  adjacent 
there  is  a  vertex  x  that  lies  in  each  of  (R)u  and  (T)v.  Let  r,  =  tj  =  x.  The 
sequence  of  vertices  (n,  r2, . . . , n,  ij+2t  •  •  • » n+i » •  •  • » n)  created  by  ex¬ 
changing  the  successors  of  x,  i.e.,  exchanging  outgoing  arcs  of  x,  forms  a  single 
cycle.  ® 

To  illustrate  Lemma  VI.2  consider  the  PCR3  where  f{zo,z\yZ2)  =  zq  on  B2 
shown  in  Figure  18.  This  function  generates  the  edge-factor  of  Figure  19A.  The  cycles, 
represented  by  a  sequence  of  arcs,  are: 

1.  Cl  =  (0) 

2.  C2  =  (001) 

3.  C3  =  (Oil) 

4.  C4  =  (1) 
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Figure  18.  The  Good  -  de  Bntijn  digraph  B^  as  one  edge-factor 


Figure  19.  Edge-factors 
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The  cycles  Cj  amd  C3  contain  the  common  vertices  01  and  10.  By  Lemma  VI. 2 
a  single  cycle  is  formed  when  the  successors,  10  and  11,  of  the  vertex  01  are  inter¬ 
changed  between  the  cycles  C2  and  C3,  respectively.  The  cycles  now,  represented  by 
a  sequence  of  arcs,  are: 

1.  Cl  =  (0) 

2.  C2  =  (101001) 

3.  C^  =  (l) 

This  new  edge-factor  is  shown  in  Figure  19B.  Therefore,  the  problem  of  joining 
cycles  from  an  edge-factor  reduces  to  finding  the  vertices  common  between  auljacent 
cycles. 

To  construct  the  algorithm  that  generates  a  de  Bruijn  sequence,  a  representa¬ 
tive  vertex  for  each  cycle  plays  an  important  role  and  must  be  identified.  The  cycle 
representative  for  each  cycle  is  any  vertex  in  the  cycle  that  has  the  smsdiest  directed 
distance  from  a  designated  vertex  vd  in  Bn*  (The  designated  vertex  t;£7  is  am  arbitrary 
but  fixed  vertex  in  B„.)  The  cycle  representative  (C,)fl  of  the  cycle  (C,)  is  defined  to 
be  any  vertex  Vi  €  {Ci)^  such  that  for  the  given  designated  vertex  vp  €  V, 

dD{vD,Vi)=  min  dD(vD,u). 

If  there  is  more  than  one  vertex  on  a  cycle  of  minimum  directed  distance  from  vq,  we 
can  define  a  unique  representative  by  selecting,  say,  the  vertex  of  largest /smallest  dec¬ 
imal  value.  (Recall  that  the  directed  distance  doix,  y)  between  x  =  (xq,  xi,  *  •  • ,  x„_i) 
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and  y  =  (yo, yi,  •  •  • ,  Vn-i)  in  B„  is  n  —  7,  where  7  is  the  longest  string  of  consecutive 
bits  where  (xn— •y « Xn — n+i  1  •  •  •  1  ^n— 1 )  ~  (yo»  J/i*  •  •  •  1  J/'c— 1  )•) 

Lemma  VI. 3  shows  that  cycle  representatives  are  unique  to  a  cycle  unless  the 
representative  is  the  designated  vertex  vd- 

Lemma  VI.3  For  an  edge-factor  F  =  {CiyCj,. . .  ,Cm}  of  Bn  =  (V, i4),  let 
Ci  €  F  where  dD(vD,(^i)v)  >  0  and  vd  is  the  designated  vertex.  Then  there 
exists  a  cycle  Cj  €  F  adjacent  to  Ci  such  that  dD(uDy(Cj)t.)  <  (Ci)i;)- 

Proof:  The  proof  is  analogous  to  the  proof  of  Lemma  IV.  17.  ■ 

Corollary  VI.4  Every  vertex  that  is  the  representative  of  its  cycle  but  not 
equal  to  the  designated  vertex  vd  is  incident  with  exactly  two  cycles  in  any 
edge- factor. 

Proof:  This  follows  immediately  from  Lemma  VI.3.  ■ 

Lemma  VI.3  can  be  applied  with  distance  functions  other  th2m  the  directed 
distance  from  a  designated  vertex.  The  representative  vertex  of  the  cycle  C,  can 
be  defined  as  either  the  vertex  for  which  d£){(C,)/*,  vd,  )  =  niin„g(c^),  dD{{Ci)R,  vd)  or 
the  vertex  for  which  d{vD,  (Ci)^)  =  min„g(c.),  d{vD,  (C,)/*).  In  either  of  these  cases 
the  properties  of  the  representative  vertex  remain  the  same  and  the  concepts  in  the 
next  section  can  be  applied. 


103 


D.  THE  CYCLE  JOINING  ALGORITHM 

Consider  a  feedback  shift  register  with  n  +  1  stages.  The  tth  state  of  the 
FSRn^i  is  denoted  by  S,  =  Si+i, . . . , Sj+n).  Given  a  nonsingular  feedback  function 
/  on  the  register,  the  next  state  following  5,  =  (s,, s,+i, . . .  ,s,+„)  is  obtained  from 
the  Cycle  Joining  Algorithm  as  follows: 

Cycle  Joining  Algorithm 

Input:  A  feedback  function  /  for  a  FSRn^i,  the  current  state  5,  =  (s,,  s,+i, . . . ,  s,+„) 
of  the  register,  and  a  list  of  representative  vertices. 

Output:  The  next  state  Si^i. 

(Cl)  if  (s,+i, . . . ,  Sj+n)  designates  a  representative  vertex, 
then  Si+i  :=  /(Sj)  ©  1 
else  5i+i  =  fiSi). 

(This  cycle  joining  algorithm  is  zmalogous  to  others,  however,  it  was  originally  used 
by  Fredricksen  [Ref.  32].) 

We  can  apply  Lemmas  VI.2  and  VI.3  to  provide  a  simple  way  to  construct  a 
de  Bruijn  cycle  of  length  2"'*'^.  At  each  shift  of  the  FSRn+i  we  check  the  last  n  bits  in 
the  register.  The  n  bits  represent  a  vertex  in  If  the  vertex  is  a  cycle  representative 
and  this  is  the  first  occurrence,  we  expand  the  current  cycle,  by  joining  an  adjacent 
cycle  in  the  edge-factor  to  the  current  cycle.  If  the  vertex  is  not  a  cycle  representative 
we  continue  along  the  current  cycle.  If  the  vertex  is  a  cycle  representative  and  this 
is  the  second  occurrence,  we  close  the  current  cycle  and  enter  a  previous  cycle.  The 
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algorithm  constructs  an  Eulerian  walk  along  Bn  by  successively  selecting  cycles 
from  an  edge-factor  to  be  added  and  traversed.  This  is  accomplished  by  stepping  along 
a  nonsingular  shift  register  until  the  current  state  identifies  a  cycle  representative  at 
which  time  the  Cycle  Joining  Algorithm  is  executed  and  a  new  cycle  is  added. 

We  now  describe  an  algorithm  (Algorithm  H)  that  constructs  a  Eulerian  walk 
eilong  Bn  beginning  at  the  designated  vertex  vq-  Algorithm  H  adjoins  cycles  from  an 
edge-factor  F  to  create  the  Eulerian  walk.  This  algorithm  is  similar  to  other  common 
algorithms  [Ref.  29]  that  join  cycles  from  a  factor  cind  hence  create  a  Hamiltonian 
cycle  along  Bn-  Here  the  edge-factor  method  allows  us  to  use  the  properties  of 
distance  in  the  digraph  to  uniquely  identify  a  common  vertex  on  adjacent  cycles 
in  Bn-  Loosely,  speaking,  Algorithm  H  moves  forward,  entering  and  adding  cycles 
at  a  cycle  representative  of  increasingly  greater  distance  from  the  vertex  vd  for  as 
long  as  this  is  possible.  Each  cycle  representative  is  on  exactly  two  cycles.  Algorithm 
H  enters  a  cycle  representative  {D)r  for  the  first  time  adong  an  arc  on  a  cycle  C 
where  dD{vo,  {C)v)  <  d£j(uD,  The  algorithm  immediately  departs  the  vertex 

{D)r  along  an  arc  on  the  cycle  D.  When  the  cycle  representative  {D)r  is  entered  for 
the  second  time,  it  is  along  an  arc  on  the  cycle  D  and  the  vertex  {D)r  is  departed 
along  an  arc  on  the  cycle  C-  When  it  is  no  longer  possible  to  enter  a  new  cycle  at 
a  greater  distance,  the  algorithm  traverses  each  arc  on  the  last  adjoined  cycle  and 
then  backtracks  to  the  previous  cycle  at  its  cycle  representative.  The  backtracking 
continues  to  “close”  previously  entered  cycles  until  the  first  cycle  from  which  it  is 
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possible  to  again  go  forward  to  enter  and  add  new  cycles.  A  cycle  C  could  conceivably 
be  entered  at  several  places  along  the  walk.  Algorithm  H,  however,  does  not  allow 
a  cycle  C  to  be  entered  for  the  first  time  at  any  vertex  other  than  at  the  cycle 
representative  (C)^. 

ALGORITHM  II  [Constructs  an  Eulerian  walk  along  B„] 

Input:  A  Good  -  de  Bruijn  digraph  Bn  =  (V,  A),  a  designated  vertex  ud  €  V  where 
vd  =  (ui, U2>  •  •  • , Un)  €  B",  and  a  shift  register  with  a  nonsingular  feedback  function 
/  of  degree  n  +  1. 

Output:  A  sequence  (0,si,S2, . . . ,S2n+i_i)  constituting  an  Eulerian  cycle  through 
B„. 

Algorithm  H 

Initially  set  the  register  to  Si  =  (0, u*, t/2,  •  •  • ,  Wn)  =  (si,  S2, . . . , s„).  Step  the  register 
once  to  produce  S2  =  (^i,  S2, . . . , f{Si  01)),  and  set  i  =  3. 

HI  Si-1  =  (sj-i ,  Sj, . . . ,  Si+n)  lies  on  cycle  C.  We  examine  each  of  the  states 
on  the  cycle  until  a  state  is  found  that  identifies  a  vertex  of  greater  distance 
from  the  designated  vertex  than  the  current  state.  If  no  such  state  exists 
(i.e.  the  current  state  identifies  the  cycle  representative),  then  go  to  [Hi a], 
else  go  to  [Hlb]. 

Hla  Si  =  (s,_i,3,,...,Si+„_i,/(5<-i));  go  to  [H2]. 

Hlb  Si  —  (si— 1, Si, . . . , y(5j_i) 0  1)5  goto  [H2]. 

H2  Increment  i. 

H2a  If  i  <  2"'''^  go  to  [Hi];  else  end. 

We  now  show  that  the  walk  constructed  by  Algorithm  H  visits  every  edge  in 
Bn  exactly  once. 
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Corollary  VI. 5  Let  F  =  {Ci,C2,...,Cm}  be  an  edge-factor  of  Bn  =  {V,A) 
and  Vo  is  the  designated  vertex.  Furthermore,  let  vq  be  a  vertex  in  exactly  one 
cycle  in  F.  We  define  a  graph  Tb„  as  follows:  the  vertices  in  Tb^  o^re  the  cycle 
representatives  of  the  cycles  in  F,  and  there  is  an  edge  between  vertices  x  and 
y  in  Tb„  if  and  only  if  x,y  ^  (C'i)v  for  some  cycle  Ci  €  F.  Then  T  is  a  tree. 

Proof:  The  order  of‘7B„  is  m,  the  number  of  cycles  in  F.  By  Theorem  VI.4 
for  each  cycle  C,  €  F  where  (C'i)v)  >  0  there  exists  a  unique  cycle 

Cj  €  F  adjacent  to  C,  such  that  dD(vDi(C'j)v)  <  do{v,{Ci)vC).  Therefore, 

TSn  is  connected  and  the  size  of  the  edge  set  of  Tb„  is  m  —  1.  Hence,  Tb„  is  a 
tree.  In  like  manner,  in  the  case  where  vo  is  in  exactly  2  cycles  in  F,  Tb„  is 
also  a  tree.  ■ 

Thus,  given  an  edge-factor  F  of  the  digraph  Bn  =  {V,  A)  and  a  vertex  vp  €  V, 
Algorithm  H  joins  every  cycle  in  F  and  thereby  constructs  an  Eulerian  wzdk  along 
Bn  beginning  at  the  vertex  vo-  (Algorithm  H  is  in  fact  a  depth  first  search  of  TBn-) 
Since  TBn  connected,  every  arc  is  traversed  exactly  once. 

To  illustrate  the  algorithm,  consider  a  FSRs  and  the  pure  cycling  register 
defined  by  the  feedback  function  f{zo,zi,Z2,Z3,Z4)  =  zq.  Let  the  designated  vertex 
Vo  in  B4  be  0011.  The  edge-factor  contains  the  eight  cycles  and  their  respective 
rep  ’sentatives  as  shown  in  Table  IX.  (Note:  the  eilgorithm  does  not  compute  the 
table  of  representatives  a  priori,  rather,  representatives  are  computed  on  line.)  The 
cycle  (00101)  has  two  vertices  1001  and  1010  each  of  the  same  minimum  directed 
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Table  IX.  Cycles  and  representatives  for  a  PCRs,  where  vd  =  0011 


distance  from  vd-  We  arbitrarily  select  the  vertex  1010  to  be  the  representative,  as 
it  is  of  largest  decimal  value.  Applying  the  Cycle  Joining  Algorithm  to  this  FSR^, 
l.ie  cycles  are  joined  together  to  form  one  cycle,  shown  in  Figure  20.  The  de  Bruijn 
sequence  representing  this  cycle  is 

00011111011100110100101011000001. 

Changing  the  cycle  representative  of  (00101)  to  1001,  the  cycles  are  joined  to 
form  a  different  sequence 

0001 111101110010100110101 1000001 . 

Continuing  the  example  with  the  same  feedback  function,  when  the  designated  vertex 
is  changed  to  1001  auid  using  the  smallest  representative  0101  for  the  cycle  (01011)  a 
different  sequence  is  obtained 

01001111101110010110101000110000. 
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Finally,  with  vq  =  1001  and  selecting  0110  is  the  representative  of  the  cycle  (01011) 
we  get  the  sequence 

01001111101110010100011010110000. 

Each  of  these  sequences  are  different  (i.e.,  none  is  a  cyclic  shift  of  another).  It  is  not 
always  the  case  that  distinct  vp's  yield  different  sequences.  Clearly,  identical  sets 
of  representative  vertices  generate  identical  sequences.  The  size  of  the  class  of  de 
Bruijn  sequences  generated  by  the  pure  cycling  register  using  Algorithm  H  has  not 
been  determined.  Using  just  the  feedback  function  of  PCR4,  however,  12  of  the  16 
de  Bruijn  sequences  of  length  16  can  be  obtained  by  changing  the  vertex  vq- 

E.  THE  FEEDBACK  FUNCTIONS 

Using  the  PC  Rn,  the  Cycle  Joining  Algorithm  requires  storage  of  n  bits  for  the 
current  state  Si,  n  bits  for  cycling  the  current  state,  and  n  bits  to  store  the  current 
representative  on  a  cycle,  for  3n  bits  of  stor^e.  The  gre.-r<;est  amount  of  time  in  this 
algorithm  is  spent  to  determine  the  representative  vertex  on  each  cycle.  The  worst 
case  occurs  when  the  current  state  is  the  cycle  representative  and  the  entire  cycle 
must  be  traversed.  Therefore,  having  an  edge-factor  with  relatively  short  cycles  is 
desirable.  For  an  arbitreiry  nonsingular  feedback  function,  the  length  of  the  longest 
cycle  can  be  expected  to  be  very  large  [Ref.  6).  Jansen  et.al.  provides  an  approximate 
count  on  the  number  of  linear  functions  that  yield  cycles  whose  maximum  cycle  length 
is  4n  [Ref.  29]. 
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VII 


CONCLUSIONS 


‘^Results!  Why,  I  have  got  a  lot  of  results.  /  know  several  thousand 
things  that  won’t  work.  ” 


Thomas  Edison 


A.  SYNOPSIS  OF  THE  THESIS 

The  binary  le  Bruijn  cycle  provides  a  special  case  of  extracting  distinct 
n-tuples  from  a  2’*-long  binary  sequence.  Generalizing  the  de  Bruijn  property,  i.e., 
extracting  n  bits  that  are  not  necessarily  consecutive  along  the  cycle,  was  the  funda¬ 
mental  idea  motivating  this  work.  TLc're  appears  to  be  no  previously  published  work 
analyzing  the  set  of  complete  cycles,  a  set  that  includes  the  well-known  de  Bruijn 
cycles. 

In  Chapter  III,  the  framework  is  developed  to  systematically  comb  from  a 
sequence  every  n-tuple  appearing  at  successive  positions  along  a  subsequence  of  the 
sequence.  We  found  that  the  run  structure  for  all  but  one  class  of  complete  sequences 
is  completely  determined.  The  set  of  complete  sequences  with  m  <  n  consecutive 
teeth  defines  a  cycle  visiting  every  arc  an  equal  number  of  times  along  an  appropriate 
Good  -  de  Bruijn  digraph.  Only  the  case  for  m  =  n  —  1  is  thoroughly  discussed  in 
Chapter  III.  Chapter  III  concludes  with  a  probabilistic  argument  that  shows  that  the 
size  of  the  set  of  complete  sequences  for  a  specific  (/,  n)-comb  is  substantially  smaller 
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than  the  size  of  the  set  of  2"-long  de  Bruijn  cycles.  The  class  of  de  Bruijn  sequences 
is  the  largest  class  of  complete  sequences.  This  surprising  result  is  not  intuitively 
obvious,  given  the  randomness  properties  of  the  eligible  sequences. 

Chapter  IV  provides  a  comprehensive  survey  on  double  Eulerian  cycles  along 
graphs  and  digraph.  The  necessary  and  sufficient  conditions  for  a  graph  or  digraph 
to  have  a  double  Eulerian  cycle  are  given.  We  describe  a  class  of  double  Eulerian 
cycles  along  a  graph  (digraph)  that  minimizes  the  difference  of  visitation  times  along 
each  edge  (arc).  The  necessary  and  sufficient  conditions  are  found  to  generate  a 
minimum  double  Eulerian  cycle.  A  closed  formula  using  the  edge-status  of  a  graph 
is  shown  to  provide  the  value  of  a  graph.  An  algorithm  is  presented  that,  given 
the  appropriate  edge-factor,  generates  the  minimum  double  Eulerian  cycle  along  a 
digraph.  A  closed  formula  is  provided  for  the  value  of  a  Eulerian  digraph.  A  conjecture 
for  the  appropriate  edge-factor  to  generate  the  value  of  the  Good  -  de  Bruijn  digraph 
is  made. 

Chapter  V  provides  the  necessary  and  sufficient  conditions  for  the  permutation 
of  the  run  sequences  in  a  de  Bruijn  cycle.  We  see  that  the  structure  of  distinct 
n-tuples  is  a  randomness  property  of  the  runs  in  the  sequence.  This  structure  is  a 
specific  case  of  the  property  that  each  arc  in  a  walk  defined  by  a  class  of  complete 
cycles  is  equally  likely  to  be  taken  along  an  appropriate  digraph  Bn- 

A  new  efficient  algorithm  for  the  generation  of  classical  de  Bruijn  sequences  is 
developed  in  Chapter  VI.  The  algorithm  uses  only  3n  bits  of  storage  to  generate  a 
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de  Bruijn  cycle  of  length  2".  Although  this  algorithm  adjoins  cycles  in  the  usual  way, 
the  class  of  sequences  generated  is  different  from  that  found  by  previous  algorithms. 


B.  OPEN  QUESTIONS 

The  scope  of  potential  research  in  the  area  of  generalized  de  Bruijn  cycles  is 
vast.  The  very  fundamental  question  of  the  existence  of  nonclassical  de  Bruijn  cycles 
for  each  value  of  n  remains  to  be  answered.  A  general  formula  describing  the  size  of 
the  set  of  nonclassical  sequences  continues  to  be  an  elusive  problem. 

The  formulation  of  an  algorithm  to  generate  a  class  of  complete  cycles  for  each 
value  of  n  would  be  a  significant  contribution  in  this  area  of  research. 

There  are  many  different  (Z,n)-combs  available  to  extract  distinct  n-tuples. 
Further  analysis  on  specific  combs  would  be  a  valuable  extension  of  the  work  done  in 
Chapter  III. 

Proving  the  conjecture  in  Chapter  IV  on  the  value  of  the  Good  -  de  Bruijn 
digraphs  would  complete  an  important  contribution  to  the  anzdysis  of  double  Eulerian 
cycles  on  directed  graphs.  More  generally,  finding  the  appropriate  edge-factor  of  a 
digraph  to  generate  the  value  of  a  graph  would  be  a  very  nice  result. 

The  class  of  sequences  generated  by  the  algorithm  in  Chapter  VI  has  not  been 
fully  examined.  The  relation,  if  any,  between  these  sequences  and  sequences  generated 
by  previously  algorithms  would  be  an  interesting  result.  Determining  the  designated 
vertex,  representative  vertices,  and  the  distance  function  to  generate  the  prefer  ones 
sequences,  for  example,  could  provide  insight  into  a  new  algorithm. 
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These  are  but  a  few  of  the  promising  research  areas  that  arise  from  the  chal¬ 
lenge  of  understanding  complete  sequences.  A  thorough  understanding  of  nonclassical 
de  Bruijn  sequences  is  an  endeavor  to  find  structure  in  randomness,  a  statement  that 
might  be  make  of  every  research  endeavor! 

This  appears  to  be  the  first  study  of  complete  sequences,  and  it  portends  to 
be  a  rich  area  of  continuing  research. 
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APPENDIX  A.  SUMMARY  OF  SOME 
PREVIOUS  WORK  ON  CIRCUITS  IN 
DIRECTED  GRAPHS 


“/n  all  things,  there  is  a  law  of  cycles” 

Tacitus 

The  following  is  a  brief  summary  of  the  derivation  of  Theorem  III. 3.  This 
theorem  allows  one  to  easily  determine  the  number  of  ways  to  traverse  each  arc  of 
an  Eulerian  digraph  exactly  <7  times.  The  development  follows  the  work  by  van 
Aardenne-Ehrenfest  and  de  Bruijn  [Ref.  12]. 

Let  Sm  be  the  permutation  group  (or  symmetric  group)  on  m  elements.  If  5 
is  a  subset  of  5m  then  the  number  of  elements  of  5  of  order  m  is  denoted  by  ||5||  and 
the  total  number  of  elements  in  5  by  |5|.  A  subset  P  of  Sm  is  called  a  V-set  (in  Sm), 
whenever  it  has  the  property  that  |lsP||  has  the  same  value  for  all  s  G  Sm* 

Let  k  and  n  be  natural  numbers,  emd  take  m  =  Am.  Consider  the  set  Em  of 
m  elements,  divided  into  k  systems,  each  containing  n  objects.  The  set  H  denotes 
the  subgroup  of  Sm  consisting  of  all  k\{n\)'‘  permutations  with  the  property  that  if 
h  ^  H  then  ha  and  hb  belong  to  the  same  system  whenever  a  and  b  belong  to  the 
same  system.  In  other  words,  H  transforms  systems  into  systems. 

The  following  theorems  from  [Ref.  12]  sure  stated  with  proof. 
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Theorem  A.l  (van  Aardenne-Ehrenfest  and  de  Bruijn)  H  is  a  T>-set 


in  Sm- 

Let  R  denote  the  set  of  all  permutations  in  5m  with  the  property  that  the  n 
objects  of  each  system  are  transformed  into  objects  of  n  different  systems.  In  other 
words,  if  r  €  i?  where  a  and  b  belong  to  the  same  system,  then  ra  and  rb  belong  to 
different  systems.  It  can  also  be  shown  that  R  is  a.  V-set  in  5m. 

Let  D  =  (V,  A)  be  a  regular  Eulerian  digraph  where  in(x)  =  out(x)  =  a, 
Vx  6  V.  Two  cycles  are  considered  identical  whenever  the  arcs  of  the  first  cycle  are 
a  cyclic  permutation  of  the  arcs  of  the  second. 

The  number  of  EuV  riui  cycles  in  D  is  denoted  by  1|D||.  A  permutation  P 
of  the  set  of  arcs  of  A  =  (ai,...,a,)  is  called  conservative  (with  respect  to  D),  if 
Poi  =  Qj  always  implies  that  a,  is  adjacent  to  aj.  Let  £„  be  an  arbitrary  but  fixed 
Eulerian  cycle  in  D.  The  set  of  all  conservative  permutations  of  D  can  be  represented 
as  BEo,  where  B  —  Bi  x  ...  x  B\v\  is  the  group  of  all  permutations  where  Bi  is 
the  group  of  permutations  that  permute  arcs  having  the  same  initial  vertex  Vi,  but 
leave  invariant  all  other  arcs.  Therefore,  any  Eulerian  cycle  determines  a  conservative 
permutation  and  any  conservative  permutation  determines  an  Eulerian  cycle.  Hence, 
||Z)||  =  \\BEo\\.  Let  B\, ...,  B\v\  be  subsets  of  Bi, . . .  ,H|v|  respectively,  where 

B  =  BiX...x  B\v\. 

The  set  of  Eulerian  cycles  defined  by  B,  denoted  b||^||,  are  the  cycles  corresponding 
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to  a  permutation  bEg  where  6  €  B  and  Eg  is  a  fixed  conservative  permutation.  Using 
this  notation,  we  have 


llDlli=8llDll,  B\\D\\^\\BEg\U. 

If  for  each  i,  Bi  is  a  P-set  in  Bi,  then  B  is  called  normal.  The  following  result 
provides  a  relationship  between  the  set  of  all  conservative  permutations  and  a  subset 
of  consvervative  permutations. 

Theorem  A.2  (van  Aardenne-Ehrenfest  and  de  Bruijn)  If  B  is  normal, 
then 

fliipii  »iig|i 

\B\  ~  |B|  ’ 

where  |B1  and  |B1  denote  the  number  of  elements  of  B  and  B,  respectively. 

Recall  that  D  =  (V,i4)  is  an  Eulerian  regular  directed  graph  with  jUI  vertices 
and  |/4|  arcs.  If  in(t;,)  =  <7  for  1  <  j  <  |U1,  then  |A|  =  <t|V|. 

Let  A  be  a  positive  integer.  Then  by  we  denote  the  graph  that  arises  from 
D  if  we  replace  any  arc  (x,y)  by  A  arcs  (x,y).  We  see  that  £)(a)  has  |U|  vertices  and 
AjA)  =  A<r|V|  arcs.  The  set  of  A  arcs  arising  from  each  arc  in  D  are  said  to  form  a 
bundle. 

We  consider  3  categories  of  cycles  that  traverse  every  arc  in  D(\).  Each  of 
these  categories  is  normal  in  the  sense  described  above. 

1.  Bi  =  Bi,  where  Bi  is  the  permutation  group  of  order  ((A<r)!). 


117 


2.  Bi  =  Hi,  where  Hi  is  the  subset  of  B,  that  associates  bundles  with  bundles. 


That  is,  if  a  and  b  are  in  the  same  bundle,  then  ha  and  hh  are  in  the  same  bundle 
where  h  €  Hi.  The  order  of  Hi  is  <r!(A!)‘^. 

3.  Bi  =  Ri,  where  Hi  is  the  subset  of  Bi  that  associates  the  arcs  of  each  bundle 
with  <7  different  bundles.  That  is,  if  a  and  b  are  in  the  same  bundle,  then  ra  and  rb 
sje  in  different  bundles  where  r  €  /2,.  The  order  of  Ri  is  |f?|. 

We  have  by  Theorem  A. 2 

bIIAa)II  wIIAa)II  _  fillAA)ll 

((A<r)!)l''l  (^r!)l''l{A!)l^l  1^1  ‘  ^  ’ 

Recall  that  811/?(a)||  =  ||^(A)11i*  The  number  h||D(a)1|  is  proportional  to  the 
number  ||Z?||i.  It  can  be  seen  that  each  Eulerian  cycle  of  D  arises  from  A~‘(A!)I'^I 
different  cycles  in  lli5(A)||.  Hence, 

By  Equation  A.l  we  have, 

l|0,A)lli  = -'■'Ill’ll- (^)  -  (A-2) 

A  <7-cycle  in  Z?  is  a  cycle  containing  each  edge  of  D  exactly  a  times.  A  <7-cycle 
is  called  restricted  if  it  happens  that  any  pair  of  adjacent  edges  of  D  appears  just  once 
in  the  cycle.  The  number  of  different  (7-cycles  can  be  determined  from  Equation  A. 2. 
A  difficulty  arises  from  the  fact  that  a  <7-cycle  may  be  periodic  of  period  \E\d,  where 
d  is  a  divisor  of  <t.  Let  c(p)  denote  the  number  of  ^cycles  in  D  with  the  period  \E\p. 
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For  each  ^cycle  in  D  there  arise  Eulerian  cycles  in  D(p).  It  follows  immediately 


that 

=  E  (A.3) 

From  the  Mobius  inversion  formula, 

=  (A.4) 

d\i>  P 

and  the  number  of  unrestricted  />-cycles  equals 


5:c(<i)  =  (A.5) 

dip  P  <<1^ 

where  is  Euler’s  totient  function.  Evaluating  ||E>(p)l|i  using  Equation  A. 2,  the 
number  of  unrestricted  p-cycles  in  a  «T-regular  directed  graph  D  =  (V,  A)  is  given  by 


where  0  is  the  Euler’s  totient  function  and  the  summation  is  extended  over  all  divisors 


of  p. 
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APPENDIX  B.  A  DIGRAPH  ISOMORPHIC 

TO  Bn 

A  friend  is  ...a  second  self. 

Cicero 

In  the  Good  -  de  Bruijn  digraph  Bn  each  vertex  is  labeled  with  a  unique 
binary  n-tuple.  There  is  an  arc  (x,y)  from  vertex  x  =  (i05a;i,. . .  ,x„_i)  to  ver¬ 
tex  y  =  (yo,yi,...,yn-i)  in  Bn  if  and  only  if  (xi,X2,- •  •  ,a:n-i)  =  (yo,yi, •  •  ■  ,yn-2)- 
Equivalently,  we  can  label  each  vertex  x  =  (xi, X2, . . . ,  Xn-i)  with  the  unique  integer 

n-l 

i  that  is  the  decimal  representation  of  x  defined  by  i  =  ^  There  is  an  arc 

jsiO 

from  vertex  i  to  vertex  j  if  and  only  if  j  €  {2i,2i  +  1}  mod  2". 

We  define  Bn-(k+i)  for  0  <  fc  +  1  <  n,  to  be  a  2-regular  directed  graph  with 
2n-(fc+i)  vertices  and  2"“*  arcs.  We  label  each  vertex  in  with  a  unique  2’^'^^ 

element  set 

{2*+^i,2*+^i  -I- 1, . . .  +  2<*+*)  -  1}  where  0  <  i  <  -  1. 

There  is  am  arc  from  the  vertex  {2*'*''t,  2*'*’*i  -f  1, . . . ,  —  1}  to  the  vertex 

{2*+»i,  2*+'i  +  1, . . . ,  2*+>i  -I-  2(*+‘)  -  1}  in  if  and  only  if 

2i/  mod  2”  €  {(2*+*j),  (2*+^  +  !),•••> 2*+^j  -I-  2<*+*)  -  1}, 

where  1/  €  {2‘'+^i, 2'‘'‘’^i  +  1,  • . . , -f-  2^''+^^  -  1}. 
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Let  V{Bn)  be  the  set  of  vertices  in  B„  where  the  vertices  are  represented  by 
the  integers  i  where  0  <  t  <  2"  -  1.  Let  V(Bn-(k+i))  denote  the  set  of  vertices  in 
Bn-(fc+i)-  Furthermore,  let  A(B„)  and  A(B„)  denote  the  set  of  arcs  in  and 
respectively. 

Lemma  B.l  The  function  4>:  V{Bm)  -*■  V {Bn-(k+i))  for  m  =  n  —  (fc  +  1),  is 
defined  by  the  rule 

t;<^=  +  +  _  i)  mod  2". 

Then  <i>:  ViBm)  V{Bn-{k+i))  is  one  to  one. 

Proof:  Let  i,j  e  V{Bm),  with  i  ^  j.  Then 

i(f>  =  {2''+‘t,2''+^t  +  l,...,2*+‘t  +  2<*+*)-l}mod2’‘,  and 
j<t>  =  {2''+‘i,2*+^j  +  l,...,2*-*'V  +  2<*''*'^^-l}niod2". 

Assume  i4>  =  j<f>.  Since  2”  >  2"^,  it  follows  immediately  that  i  =  j.  Therefore, 

(t>  is  1-1.  ■ 

Lemma  B.2  For  the  rule  <i>  as  defined  in  Lemma  B.l,  <i>:  V{Bn)  — >  V{B„-(k+i)) 
is  onto. 


Proof:  Let  {2*+^i,2''+^i  -I- 1, . . .  ,2''+^i  -b  -  1}  mod  2"  6  V(B„).  Then 
{2*+^i,  -hi,...,  2*+‘i  +  mod  2"  =  i(^).  ■ 


Lemma  B.3  The  function  0:  A(B„)  A{Bn-{k-^-i))  for  m  =  n  -  {k  +  1),  is 

defined  by  the  rule 

{hj)e=  ({  2*+‘i,2''+‘t  +  l,...,2‘+^i  +  2<‘+‘>-l}mocl2", 

{  2*+‘i,  2*+*i  +  1, . . . ,  2*+*>  +  2<*+‘)  -  1}  mod  2"), 

where  (t,  j)  €  A{Bn)  if  and  only  if  ii{d>)yjid>))  €  y4(Bn-(fc+i)).  Then  0: 
A{Bn)  -►  A{Bn-(k+i))  is  one  to  one. 

Proof:  Let  (*',/)  €  A(Bn).  with  (ij)  {i'J').  Then 

(ij)e  =  ({  2*+^i, 2^+U  +  1, . . . , 2*+^’  +  2<*+*>  -  1}  mod  2", 

{  2*+‘ j,  2*+^  j  +  1, . . . ,  2*+^  j  +  2<^+^)  -  1}  mod  2") 

and 

({  2''+ii',2*+‘i'  +  l,...,2*+‘t'  +  2<''+^)-l}, 

{  2*+^ j', 2*+‘/  +  1, . . . , 2*+‘/  +  2^*+*)  -  1}  mod  2"). 

Assume  =  (i',j')0.  It  follows  immediately  that  (i,j)  =  (i',  j').  There¬ 
fore,  is  1-1.  ■ 

Lemma  B.4  For  the  rule  6  as  defined  in  Lemma  B.3,  6:  A(B„)  ->  A(B„_(ib+i)) 
is  onto. 
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Proof:  Let 


({  2‘+'i,2*+‘t  +  1,. . .  -  1}  mod  2", 

{  2‘+'j,2*+‘;  +  1,. . . ,2‘+‘j  +  2<*-""  -  1}  mod 2“)  € 

Then 

({  +  l,...,2*+‘i  +  2<*+‘)  -  1}  mod  2", 

{  2*+*i,2''+‘ j  +  1, . . .  -  1}  mod  2")  =  {ij){9). 


Theorem  B.5  The  digraphs  Bm  and  IBn-ik-^i)  for  m  =  n  -  (Ar  +  1),  are  iso¬ 
morphic. 

Proof:  The  function  <f>:  V{Bn)  ->  V"(B„_(fc+i))  defined  by  the  rule 

v4>  =  {2*+‘u,2''+^v  +  1, . . .  ,2*+^n  +  -  1}  mod  2" 

and  the  function  9:  A{Bn)  ->  A{Bn)  defined  by  the  rule 

(i,j)e=  ({  2*+*i,2*+‘i  +  l,...,2*+^t  +  2<''+*)-l}, 

{  2*+‘ j,  2^+^  j  +  1, . . . ,  2*+^  j  +  -  1}  mod  2”) 

are  bijections.  It  follows  by  definition  that  the  digraphs  and  Bn^^k+i)  are 
isomorphic.  ■ 
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o 

{04,2,3,4,5,6,7} 

11 

{8,9,10,11,1243,14,15} 

O 

^4-(2+l) 


o 

{0,1, 2, 3} 


{4,5, 6,7} 


{8,9,10,11} 


{12,13,14,15} 

O 

^4-(l+l) 


Figure  21.  Good  -  de  Bruijn  digraphs  Ba,  54_(o+i),^4_(i+i),  and  Ba-[2-^-i) 
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